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ISOLATED MAMMALIAN DENDRITIC CELL GENES; RELATED REAGENTS 

This filing is a conversion of U.S. Provisional Patent 
Applications DX0669P, USSN 30/031,806, filed November 27, 
1996, and DX0669P1, USSN 6o/o32,767, filed December 11, 

1996, each of which is incorporated herein by reference, to 
a regular utility patent application. 


FIELD OF THE INVENTION 

The present invention contemplates compositions 
related to genes found in dendritic cells, cells which 
function in the immune system. These genes function in 
controlling development, differentiation, and/or physiology 
of mammalian immune system. In particular, the application 
provides nucleic acids, proteins, antibodies, and methods 
of using them. 


BACKGROUND OF THE INVENTION 

The circulating component of the mammalian circulatory 
system comprises various cell types, including red and 
white blood cells of the erythroid and myeloid cell 
lineages. See, e.g., Rapaport. (1987) Introduction to 
Hematology (2d ed.) Lippincott, Philadelphia, PA; Jandl 
(1987) Blood: Textbook of Hematology . Little, Brown and 
Co., Boston, MA.; and Paul (ed.) (1993) Fundamental 

Immunology (3d ed.) Raven Press, N.Y. 

Dendritic cells are antigen-presenting cells, and are 
found in all tissues of the body. They can be classified 
into various categories, including: interstitial dendritic 
cells of the heart, kidney, gut, and lung; Langerhans cells 
in the skin and mucous membranes; mterdigitating dendritic 
cells in the thymic medulla and secondary lymphoid tissue; 
and blood and lymph dendritic cells. Although dendritic 
cells in each of these compartments are CD45+ leukocytes 
that apparently arise from bone marrow, they may exhibit 
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differences that relate to maturation state and 
microenvironment. 

Dendritic cells (DC), which are specialized antigen- 
presenting cells, efficiently process and present antigens 
to, e.g., T cells. They stimulate responses from naive and 
memory T cells in the paracortical area of secondary 
lymphoid organs. There is some evidence for a role in 
induction of tolerance. 

The primary and secondary B-cell follicles contain 
follicular dendritic cells that trap and retain intact 
antigen as immune complexes for long periods of time. 

These dendritic cells present native antigen to B cells and 
are likely to be involved being the affinity maturation of 
antibodies, the generation of immune memory, and the 
maintenance of humoral immune responses. 

However, dendritic cells are poorly characterized, 
both in terms of proteins they express, and many of their 
functions and mechanisms of action. The absence of 
knowledge about the structural, biological, and 
physiological properties of these cells limits their 
understanding. Thus, medical conditions where regulation, 
development, or physiology of dendritic cells is unusual 
remain unmanageable. 


SUMMARY OF THE INVENTION 

The present invention is based, in part, upon the 
discovery of three clones isolated from activated dendritic 
cells, which identify mammalian genes of structural and 
functional relationship. It embraces agonists and 
antagonists of these molecules designated A05F12 
(diubiquitin), A07C03 (Ig family gene), and E02B02 (LAMP- 
like gene), e.g., mutations (muteins) of the natural 
sequences, fusion proteins, chemical mimetics, antibodies, 
and other structural or functional analogs. It is also 
directed to isolated genes encoding proteins of the 
invention. Various uses of these different protein or 
nucleic acid composition are also provided. 
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The present invention provides a composition of matter 
selected from: a substantially pure or recombinant A05F12 
protein or peptide exhibiting at least about 85% sequence 
identity over a length of at least about 12 ammo acids to 
SEQ ID NO: 2 or 4; a natural sequence A05F12 comprising SEQ 
ID NO: 2 or 4; a fusion protein comprising A05F12 sequence; 
a substantially pure or recombinant A07C03 protein or 
peptide exhibiting at least about 85% sequence identity 
over a length of at least about 12 amino acids to SEQ ID 
NO: 6, 8, or 10; a natural sequence A07C03 comprising SEQ 

ID NO: 6, 8, or 10; a fusion protein comprising A07C03 

sequence; a substantially pure or recombinant E02B02 
protein or peptide exhibiting at least about 85% sequence 
identity over a length of at least about 12 amino acids to 
SEQ ID NO: 12; a natural sequence E02B02 comprising SEQ ID 
NO: 12; or a fusion protein comprising E02B02 sequence. In 
certain preferred embodiments, the substantially pure or 
isolated protein will comprise a segment exhibiting 
sequence identity to a corresponding portion of an: A05F12, 
wherein: the homology is at least about 90% identity and 
the portion is at least about 9 amino acids; the homology 
is at least about 80% identity and the portion is at least 
about 17 amino acids; or the homology is at least about 70% 
identity and the portion is at least about 25 amino acids; 
A07C03, wherein: the homology is at least about 90% 
identity and the portion is at least about 9 amino acids; 
the homology is at least about 80% identity and the portion 
is at least about 17 amino acids; or the homology is at 
least about 70% identity and the portion is at least about 
25 amino acids; or E02B02, wherein: the homology is at 
least about 90% identity and the portion is at least about 
9 amino acids; the homology is at least about 80% identity 
and the portion is at least about 17 amino acids; or the 
homology is at least about 70% identity and the portion is 
at least about 25 amino acids. Other preferred embodiments 
include where: A05F12 comprises a mature sequence of Table 
1; the A05F12 protein or peptide: is from a warm blooded 
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said identity 
rodent A05F12 


animal selected from a primate or rodent, such as a human 
or mouse; comprises at least one polypeptide segment of SEQ 
ID NO: 2 or 4; exhrbits a plurality of portions exhibiting 

is a natural allelic variant of a primate or 
has a length at least about 30 amino acids; 
exhibits at least two non-overlapping epitopes which are 
specific for a primate or rodent A05F12; exhibits a 
sequence identity at least about 90% over a length of at 
least about 20 amino acids to a primate or rodent A05F12; 
has a molecular weight of at least 100 kD with natural 
glycosylation; is a synthetic polypeptide; is attached to a 
solid substrate; is conjugated to another chemical moiety; 
is a 5-fold or less substitution from natural sequence,- or 
is a deletion or insertion variant from a natural sequence; 
A07C03 comprises a mature sequence of Table 2; the A07C03 
protein or peptide: is from a warm blooded animal selected 
from a primate or rodent, such as a human or mouse; 
comprises at least one polypeptide segment of SEQ ID NO: 8 
or 10; exhibits a plurality of portions exhibiting said 
identity; is a natural allelic variant of a primate or 
rodent A07C03; has a length at least about 30 amino acids; 
exhibits at least two non-overlapping epitopes which are 
specific for a primate or rodent A07C03; exhibits a 
sequence identity at least about 90% over a length of at 
least about 20 amino acids to a primate or rodent A07C03; 
has a molecular weight of at least 100 kD with natural 
glycosylation; is a synthetic polypeptide; is attached to a 
solid substrate; is conjugated to another chemical moiety; 
is a 5-fold or less substitution from natural sequence; or 
is a deletion or insertion variant from a natural sequence; 
E02B02 comprises a mature sequence of Table 3; or the 
E02B02 protein or peptide: is from a warm blooded animal 
selected from a primate, such as a human; comprises at 
least one polypeptide segment of SEQ ID NO: 12; exhibits a 
plurality of portions exhibiting said identity; is a 
natural allelic variant of a primate E02B02; has a length 
at least about 30 amino acids; exhibits at least two non- 
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overlapping epitopes which are specific for a primate 
E02B02; exhibits a sequence identity at least about 90% 
over a length of at least about 20 amino acids to a primate 
E02B02; has a molecular weight of at least 100 kD with 
natural glycosylation; is a synthetic polypeptide; is 
attached to a solid substrate; is conjugated to another 
chemical moiety; is a 5-fold or less substitution from 
natural sequence; or is a deletion or insertion variant 
from a natural sequence. Other preferred embodiments 
include a composition comprising: a sterile A05F12 protein 
or peptide; the A05F12 protein or peptide and a carrier, 
wherein the carrier is: an aqueous compound, including 
water, saline, and/or buffer; and/or formulated for oral, 
rectal, nasal, topical, or parenteral administration; a 
sterile A07C03 protein or peptide; the A07C03 protein or 
peptide and a carrier, wherein the carrier is: an aqueous 
compound, including water, saline, and/or buffer; and/or 
formulated for oral, rectal, nasal, topical, or parenteral 
administration; a sterile E02B02 protein or peptide; or the 
E02B02 protein or peptide and a carrier, wherein the 
carrier is: an aqueous compound, including water, saline, 
and/or buffer; and/or formulated for oral, rectal, nasal, 
topical, or parenteral administration. 

In fusion protein embodiments, the invention provides 
such comprising: mature protein sequence of Table 1, 2, or 

3; a detection or purification tag, including a FLAG, His6, 
or Ig sequence; or sequence of another receptor protein. 

Kit embodiments include those comprising a protein or 
polypeptide as described, and: a compartment comprising the 
protein or polypeptide; and/or instructions for use or 
disposal of reagents in the kit. 

Other aspects of the invention include a binding 
compound comprising an antigen binding site from an 
antibody, which specifically binds to a natural: A05F12 
protein, wherein: the protein is a primate or rodent 
protein; the binding compound is an Fv, Fab, or Fab2 
fragment; the binding compound is conjugated to another 
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chemical moiety; or the antibody: is raised against a 
peptide sequence of a mature polypeptide of Table 1; is 
raised against a mature primate or rodent A05F12; is raised 
to a purified human A05F12; is raised to a purified mouse 
5 AO5FI2; is lmmunoselected; is a polyclonal antibody; binds 

to a denatured A05F12; exhibits a Kd to antigen of at least 
30 |iM; is attached to a solid substrate, including a bead 
or plastic membrane; is in a sterile composition; or is 
detectably labeled, including a radioactive or fluorescent 
10 label; A07C03 protein, wherein: the protein is a primate or 

rodent protein; the binding compound is an Fv, Fab, or Fab2 
fragment; the binding compound is conjugated to another 
chemical moiety; or the antibody: is raised against a 
peptide sequence of a mature polypeptide of Table 2; is 
15 raised against a mature primate or rodent A07C03; is raised 

to a purified human A07C03; is raised to a purified mouse 
A07C03; is lmmunoselected; is a polyclonal antibody; binds 
to a denatured A07C03; exhibits a Kd to antigen of at least 
30 |1M; is attached to a solid substrate, including a bead 
20 or plastic membrane; is in a sterile composition; or is 

detectably labeled, including a radioactive or fluorescent 
label; or E02B02 protein, wherein: the protein is a primate 
protein; the binding compound is an Fv, Fab, or Fab2 
fragment; the binding compound is conjugated to another 
25 chemical moiety; or the antibody: is raised against a 

peptide sequence of a mature polypeptide of Table 3; is 
raised against a mature primate E02B02; is raised to a 
purified human E02B02; is lmmunoselected; is a polyclonal 
antibody; binds to a denatured E02BQ2; exhibits a Kd to 
30 antigen of at least 30 pM; is attached to a solid 

substrate, including a bead or plastic membrane; is in a 
sterile composition; or is detectably labeled, including a 
radioactive or fluorescent label 

Certain kit embodiments of the invention include those 
comprising such a binding compound, and: a compartment 
comprising the binding compound; and/or instructions for 
use or disposal of reagents in the kit. Preferably, the 
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kit is capable of making a qualitative or quantitative 
analysis. Other compositions are provided, e.g., those 
comprising: the binding compound, as described, and a 
carrier, wherein the carrier is: an aqueous compound, 

5 including water, saline, and/or buffer; and/or formulated 

for oral, rectal, nasal, topical, or parenteral 
administration. 

Nucleic acid embodiments include an isolated or 
recombinant nucleic acid encoding a protein or peptide or 
10 fusion protein described, wherein: the A05F12 protein or 

peptide is from a mammal, including a primate or rodent; 
the nucleic acid: encodes an antigenic peptide sequence of 
Table 1; encodes a plurality of antigenic peptide sequences 
of Table 1; exhibits at least about 80% identity to a 
15 natural cDNA encoding the segment; is an expression vector; 

further comprises an origin of replication; is from a 
natural source; comprises a detectable label; comprises 
synthetic nucleotide sequence; is less than 6 kb, 
preferably less than 3 kb; is from a mammal, including a 
20 primate or rodent; comprises a natural full length coding 

sequence; is a hybridization probe for a gene encoding the 
A05F12; or is a PCR primer, PCR product, or mutagenesis 
primer,- the A05C03 is from a mammal, including a primate or 
rodent; the nucleic acid: encodes an antigenic peptide 
25 sequence of Table 2; encodes a plurality of antigenic 

peptide sequences of Table 2; exhibits at least about 80% 
identity to a natural cDMA encoding the segment; is an 
expression vector; further comprises an origin of 
replication; is from a natural source; comprises a 
30 detectable label; comprises synthetic nucleotide sequence; 

is less than 6 kb, preferably less than 3 kb; is from a 
mammal, including a primate or rodent; comprises a natural 
full length coding sequence; is a hybridization probe for a 
gene encoding the A07C03; or is a PCR primer, PCR product, 
35 or mutagenesis primer; the E02B02 is from a mammal, 

including a primate; or the nucleic acid: encodes an 
antigenic peptide sequence of Table 3; encodes a plurality 
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of antigenic peptide sequences of Table 3; exhibits at 
least about 80% identity to a natural cDMA encoding the 
segment; is an expression vector; further comprises an 
origin of replication; is from a natural source; comprises 
5 a detectable label; comprises synthetic nucleotide 

sequence; is less than 6 kb, preferably less than 3 kb is 
from a mammal, including a primate; comprises a natural 
full length coding sequence; is a hybridization probe for a 
gene encoding the E02B02; or is a PCR primer, PCR product, 
10 or mutagenesis primer. The invention further provides a 

cell, tissue, or organ comprising such a recombinant 
nucleic acid, including where the cell is: a prokaryotic 
cell; a eukaryotic cell; a bacterial cell; a yeast cell; an 
insect cell; a mammalian cell; a mouse cell; a primate 
15 cell; or a human cell. 

Various kits include those comprising a described 
nucleic acid, and: a compartment comprising the nucleic 
acid; a compartment further comprising a primate or rodent 
A05F12 protein or polypeptide; a compartment further 
20 comprising a primate or rodent A07C03 protein or 

polypeptide; a compartment further comprising a primate 
E02B02 protein or polypeptide; and/or instructions for use 
or disposal of reagents in the kit. Preferably, the kit is 
capable of making a qualitative or quantitative analysis. 

25 Certain preferred nucleic acids include those which: 

hybridize under wash conditions of 30° C and less than 2M 
salt to SEQ ID NO: 1 or 3; hybridize under wash conditions 
of 30° C and less than 2M salt to SEQ ID NO: 5, 7, or 9; 
hybridize under wash conditions of 30° C and less than 2M 
30 salt to SEQ ID NO: 11; exhibit at least about 85% identity 

over a stretch of at least about 30 nucleotides to a 
primate or rodent A05F12; exhibit at least about 85% 
identity over a stretch of at least about 30 nucleotides to 
a primate or rodent A07C03; or exhibit at least about 85% 
identity over a stretch of at least about 30 nucleotides to 
a primate E02B02. Other preferred nucleic acids are those 
wherein: the wash conditions are at 45° C and/or 500 mM 
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salt; the v;ash conditions are at 55° C and/or 150 roM salt; 
the identity is at least 90% and/or the stretch is at least 
55 nucleotides; or the identity is at least 95% and/or the 
stretch is at least 75 nucleotides. 

Various methods are provided, e.g., of modulating 
physiology or development of a cell or tissue culture cells 
comprising contacting the cell with: a binding composition 
which binds to a primate or rodent AQ5F12; a binding 
composition, which binds to a primate or rodent A07C03; a 
binding composition, which binds to a primate E01B02; an 
antisense nucleic acid 'which blocks expression of a primate 
or rodent A05F12; an antisense nucleic acid which blocks 
expression of a primate or rodent A07C03; or an antisense 
nucleic acid which blocks expression of a primate E02B02. 
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OUTLINE 

I. General 

II . Definitions 

III. Nucleic Acids 

IV. Making Proteins 

V. Antibodies 

VI. Purified Proteins 

VII . Physical Variants 

VIII. Binding Agent:DC Protein Complexes 

IX. Uses 

X. Kits 

XI. Binding Partner Isolation 
I. General 

The present invention provides DMA sequences encoding 
mammalian proteins expressed on dendritic cells (DC). For 
a review of dendritic cells, see Steinman (1951) Annual 
Review of Immunology 9:271-296; and Banchereau and Schmitt 
(eds. 1994) Dendritic Cells in Fundamental and Clinical 
Immunology Plenum Press, NY. These proteins are designated 
dendritic cell proteins because they were initially found 
on these cells and appear to exhibit some specificity in 
their expression. 

Specific human or mouse embodiments of these proteins 
are provided below. The descriptions below are directed, 
for exemplary purposes, to human DC genes, but are likewise 
applicable to structurally, e.g., sequence, related 
embodiments from other sources or mammalian species, 
including polymorphic or individual variants. These will 
include, e.g., proteins which exhibit a relatively few 
changes in sequence, e.g., less than about 5%, and number, 
e.g., less than 20 residue substitutions, typically less 
than 15, preferably less than 10, and more preferably less 
than 5 substitutions. These will also include versions 
which are truncated from full length, as described, and 
fusion proteins containing substantial segments of these 
sequences. 
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II. Definitions 

The term "binding composition" refers to molecules 
that bind with specificity to a these DC proteins, e.g., in 
an antibody-antigen interaction, or compounds, e.g., 
proteins, which specifically associate with the respective 
protein. Typically, the association will be in a natural 
physiologically relevant protein-protein interaction, 
either covalent or non-covalent, and may include members of 
a multiprotein complex, including carrier compounds or 
dimerization partners. The molecule may be a polymer, or 
chemical reagent. A functional analog may be a protein 
with structural modifications, or may be a wholly unrelated 
molecule, e.g., which has a molecular shape which interacts 
with the appropriate interacting determinants. The 
variants may serve as agonists or antagonists of the 

protein, see, e.g., Goodman, et al. (eds. ) (1990) Goodman_& 

Gilman's: The Pharmacological Bases of Therapeutics (8th 

ed.) Pergamon Press, Tarrytown, N.Y. 

The term "binding agent:DC protein complex", as used 
herein, refers to a complex of a binding agent and the DC 
protein. Specific binding of the binding agent means that 
the binding agent has a specific binding site that 
recognizes a site on the respective DC protein. For 
example, antibodies raised to the DC protein and 
recognizing an epitope on the DC protein are capable of 
forming a binding agent:DC protein complex by specific 
binding. Typically, the formation of a binding agent:DC 
protein complex allows the measurement of DC protein in a 
mixture of other proteins and biologies. The term 
"antibody:DC protein complex" refers to a binding agent:DC 
protein complex in which the binding agent is an antibody. 
The antibody may be monoclonal, polyclonal or even an 
antigen binding fragment of an antibody. 

"Homologous" nucleic acid sequences, when compared, 
exhibit significant similarity. The standards for homology 
in nucleic acids are either measures for homology generally 
used in the art by sequence comparison and/or phylogenetic 
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relationship, or based upon hybridization conditions. 
Hybridization conditions are described in greater detail 
below. 

An "isolated" nucleic acid is a nucleic acid, e.g., an 
5 RNA, DNA, or a mixed polymer, which is substantially 

separated from other components which naturally accompany a 
native sequence, e.g., proteins and flanking genomic 
sequences from the originating species. The term embraces 
a nucleic acid sequence which has been removed from its 
10 naturally occurring environment, and includes recombinant 

or cloned DNA isolates and chemically synthesized analogs 
or analogs biologically synthesized by heterologous 
systems. A substantially pure molecule includes isolated 
forms of the molecule. An isolated nucleic acid will 
15 generally be a homogeneous composition of molecules, but 

will, in some embodiments, contain minor heterogeneity. 

This heterogeneity is typically found at the polymer ends 
or portions not critical to a desired biological function 
or activity. 

20 As used herein, the term "DC protein" shall encompass, 

when used in a protein context, a protein having amino acid 
sequences as shown in SEQ ID NO: 2, 4, 6, 8, 10, or 12, or 

a significant fragment of such a protein. It refers to a 
polypeptide which interacts with the respective DC protein 
25 specific binding components. These binding components, 

e.g., antibodies, typically bind to the DC protein with 
high affinity, e.g., at least about 100 nM, usually better 
than about 30 nM, preferably better than about 10 nM, and 
more preferably at better than about 3 nK. 

30 The term "polypeptide" or “protein" as used herein 

includes a significant fragment or segment of said DC 
protein, and encompasses a stretch of amino acid residues 
of at least about 8 amino acids, generally at least 10 
amino acids, more generally at least 12 amino acids, often 
35 at least 14 amino acids, more often at least 16 amino 

acids, typically at least 18 amino acids, more typically at 
least 20 amino acids, usually at least 22 ammo acids, 


more 



usually at least 24 amino acids, preferably at least 26 
amino acids, more preferably at least 28 amino acids, and, 
in particularly preferred embodiments, at least about 30 or 
more amino acids. 

A "recombinant" nucleic acid is defined either by its 
method of production or its structure. In reference to its 
method of production, e.g., a product made by a process, 
the process is use of recombinant nucleic acid techniques, 
e.g., involving human intervention in the nucleotide 
sequence, typically selection or production. 

Alternatively, it can be a nucleic acid made by generating 
a sequence comprising fusion of two fragments which are not 
naturally contiguous to each other, but is meant to exclude 
products of nature, e.g., naturally occurring mutants. 

Thus, for example, products made by transforming cells with 
any non-naturally occurring vector is encompassed, as are 
nucleic acids comprising sequence derived using any 
synthetic oligonucleotide process. Such is often done to 
replace a codon with a redundant codon encoding the same or 
a conservative amino acid, while typically introducing or 
removing a sequence recognition site. Alternatively, it is 
performed to join together nucleic acid segments of desired 
functions to generate a single genetic entity comprising a 
desired combination of functions not found in the commonly 
available natural forms. Restriction enzyme recognition 
sites are often the target of such artificial 
manipulations, but other site specific targets, e.g., 
promoters, DNA replication sites, regulation sequences, 
control sequences, or other useful features may be 
incorporated by design. A similar concept is intended for 
a recombinant, e.g., fusion, polypeptide. Specifically 
included are synthetic nucleic acids which, by genetic code 
redundancy, encode polypeptides similar to fragments of 
these antigens, and fusions of sequences from various 
different species variants. 

"Solubility" is reflected by sedimentation measured in 
Svedberg units, which are a measure of the sedimentation 
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velocity of a molecule under particular conditions. The 
determination of the sedimentation velocity was classically 
performed in an analytical ultracentrifuge, but is 
typically now performed in a standard ultracentrifuge. 

See, Freifelder (1982) Physical Biochemistry (2d ed.) W.H. 
Freeman & Co., San Francisco, CA; and Cantor and Schimmel 
(1980) Biophysical Chemistry parts 1-3, W.H. Freeman & Co., 
San Francisco, CA. As a crude determination, a sample 
containing a putatively soluble polypeptide is spun in a 
standard full sized ultracentrifuge at about 50K rpm for 
about 10 minutes, and soluble molecules will remain in the 
supernatant. A soluble particle or polypeptide will 
typically be less than about 30S, more typically less than 
about 15S, usually less than about 10S, more usually less 
than about 6S, and, in particular embodiments, preferably 
less than about 4S, and more preferably less than about 3S. 
Solubility of a polypeptide or fragment depends upon the 
environment and the polypeptide. Many parameters affect 
polypeptide solubility, including temperature, electrolyte 
environment, size and molecular characteristics of the 
polypeptide, and nature of the solvent. Typically, the 
temperature at which the polypeptide is used ranges from 
about 4" C to about 65’ C. Usually the temperature at use 
is greater than about 18’ C and more usually greater than 
about 22’ C. For diagnostic purposes, the temperature will 
usually be about room temperature or warmer, but less than 
the denaturation temperature of components in the assay. 

For therapeutic purposes, the temperature will usually be 
body temperature, typically about 37" C for humans, though 
under certain situations the temperature may be raised or 
lowered in situ or in vitro. 

The size and structure of the polypeptide should 
generally be in a substantially stable state, and usually 
not in a denatured state. The polypeptide may be 
associated with other polypeptides in a quaternary 
structure, e.g., to confer solubility, or associated with 
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lipids or detergents in a manner which approximates natural 
lipid bilayer interactions. 

The solvent will usually be a biologically compatible 
buffer, of a type used for preservation of biological 
activities, and will usually approximate a physiological 
solvent. Usually the solvent will have a neutral pH, 
typically between about 5 and 10, and preferably about 7.5. 
On some occasions, a detergent will be added, typically a 
mild non-denaturing one, e.g., CHS or CHAPS, or a low 
enough concentration as to avoid significant disruption of 
structural or physiological properties of the protein. 

"Substantially pure" typically means that the protein 
is isolated from other contaminating proteins, nucleic 
acids, and other biologicals derived from the original 
source organism. Purity, or "isolation" may be assayed by 
standard methods, and will ordinarily be at least about 50% 
pure, more ordinarily at least about 60% pure, generally at 
least about 70% pure, more generally at least about 80% 
pure, often at least about 85% pure, more often at least 
about 90% pure, preferably at least about 95% pure, more 
preferably at least about 98% pure, and in most preferred 
embodiments, at least 99% pure. 

"Substantial similarity" in the nucleic acid sequence 
comparison context means either that the segments, or their 
complementary 7 strands, when compared, are identical when 
optimally aligned, with appropriate nucleotide insertions 
or deletions, in at least about 50% of the nucleotides, 
generally at least 56%, more generally at least 59%, 
ordinarily at least 62%, more ordinarily at least 65%, 
often at least 68%, more often at least 71%, typically at 
least 74%, more typically at least 77%, usually at least 
80%, more usually at least about 85%, preferably at least 
about 90%, more preferably at least about 95 to 98% or 
more, and in particular embodiments, as high at about 99% 
or more of the nucleotides. Alternatively, substantial 
similarity exists when the segments will hybridize under 
selective hybridization conditions, to a strand, or its 
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complement, typically using a sequence derived from SEQ ID 
NO: 1, 3, 5, 7, 9, or 11. Typically, selective 

hybridization will occur when there is at least about 55% 
similarity over a stretch of at least about 30 nucleotides, 
5 preferably at least about 65% over a stretch of at least 

about 25 nucleotides, more preferably at least about 75%, 
and most preferably at least about 90% over about 20 
nucleotides. See, Kanehisa (1984) Nuc . Acids Res. 12:203- 

213. The length of similarity comparison, as described, 

10 may be over longer stretches, and in certain embodiments 

will be over a stretch of at least about 17 nucleotides, 
usually at least about 20 nucleotides, more usually at 
least about 24 nucleotides, typically at least about 28 
nucleotides, more typically at least about 40 nucleotides, 
15 preferably at least about 50 nucleotides, and more 

preferably at least about 75 to 100 or more nucleotides. 

"Stringent conditions", in referring to homology or 
substantial similarity in the hybridization context, will 
be stringent combined conditions of salt, temperature, 

20 organic solvents, and other parameters, typically those 

controlled in hybridization reactions. The combination of 
parameters is more important than the measure of any 
single parameter. See, e.g., Wetmur and Davidson (1968) 

Mol. Biol. 31:349-370. A nucleic acid probe which binds to 

25 a target nucleic acid under stringent conditions is 

specific for said target nucleic acid. Such a probe is 
typically more than 11 nucleotides m length, and is 
sufficiently identical or complementary to a target nucleic 
acid over the region specified by the sequence of the probe 
30 to bind the target, under stringent hybridization 

conditions. 

Counterpart DC proteins from other mammalian species 
can be cloned and isolated by cross - species hybridization 
of closely related species. See, e.g., below. Similarity 
may be relatively low between distantly related species, 
and thus hybridization of relatively closely related 
species is advisable. Alternatively, preparation of an 
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antibody preparation which exhibits less species 
specificity may be useful in expression cloning approaches. 

The phrase "specifically binds to an antibody" or 
"specifically immunoreactive with", when referring to a 
protein or peptide, refers to a binding reaction which is 
determinative of the presence of the protein in the 
presence of a heterogeneous population of proteins and 
other biological components. Thus, under designated 
immunoassay conditions, the specified antibodies bind to a 
particular protein and do not significantly bind other 
proteins present in the sample. Specitic binding to an 
antibody under such conditions may require an antibody that 
is selected for its specificity for a particular protein. 
For example, antibodies raised to the human DC protein 
immunogen with the amino acid sequence depicted in SEQ ID 
NO: 2 can be selected to obtain antibodies specifically 
immunoreactive with that DC protein and not with other 
proteins. These antibodies recognize proteins highly 
similar to the homologous human DC protein. 

Ill. Nucleic Acids 

These DC genes are specifically expressed on dendritic 
cells. The preferred embodiments, as disclosed, will be 
useful in standard procedures to isolate genes from other 
species, e.g., warm blooded animals, such as birds and 
mammals. Cross hybridization will allow isolation of 
related proteins from individuals, strains, or species. A 
number of different approaches are available successfully 
to isolate a suitable nucleic acid clone based upon the 
information provided herein. Southern blot hybridization 
studies should identify homologous genes in other species 
under appropriate hybridization conditions. 

Purified protein or defined peptides are useful for 
generating antibodies by standard methods, as described 
below. Synthetic peptides or purified protein can be 
presented to an immune system to generate polyclonal and 
monoclonal antibodies. See, e.g., Coligan (1991) Current 
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Protocols in Irmunoloax- Wiley/Greene, MY; and Harlow and 
Lane (1989) Antibodies: A laboratory Manual Cold Spring 
Harbor Press, NY, which are incorporated herein by 
reference. Alternatively, a CD protein binding composition 
can be useful as a specific binding reagent, and advantage 
can be taken of its specificity of binding, for, e.g., 
purification of a DC protein. 

The specific binding composition can be used for 
screening an expression library made from a cell line which 
expresses the respective DC protein. Many methods for 
screening are available, e.g., standard staining of surface 
expressed ligand, or by panning. Screening of 
intracellular expression can also be performed by various 
staining or immunofluorescence procedures. The binding 
compositions could be used to affinity purify or sort out 
cells expressing the ligand. 
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Table 1. Sequence encoding a human di-ubiquitin protein, containing 
two ubiquitin domains which extend from about 1 (met) to about 83 
(pro) and from about 89 (pro) to about 165 (gly) . The putative 
polypeptide sequence comprises four cysteine residues ’which ate not 
5 characteristic of a human ubiquitin domain, e.g., that of UCRP of 

Narasimhan, et al. (1996) .J. Biol. Chen. 271:324-330. Note the 
ubiquitin conserved residues 43 (lys) and 70 (lys) are present here, 
which residues have been implicated in protein binding. The terminal 
glycine doublet is also characteristic. See SEQ ID NO: 1 and 2. 

IQ This sequence was derived from an activated CDla dendritic cell 

1 ibrary. 



GGCCCCTTGT CTGCAGAG 

ATG 

GCT 

CCC 

AAT 

GCT 

TCC 

TGC 

CTC 

TGT 

GTG 

CAT 

51 







Met 

Ala 

Pro 

Asn 

Ala 

Ser 

Cys 

Leu 

Cys 

Va 1 

His 


15 






1 




5 





10 




GTC 

CGT 

TCC 

GAG 

GAA 

TGG 

nT 

TTA 

ATG 

ACC 

TTT 

GAT 

GCC 

A A C 

lCA 

TAT 

99 


Va 1 

Arg 

Ser 

Glu 

Glu 

* i b 

Asp 

Leu 

Met 

Thr 

Phe 

Asp 

Ala 

Asn 

Pro 

Tyr 





15 





2 0 





25 




20 

GAC 

AGC 

GTG 

AAA 

AAA 

ATC 


GAA 

CAT 

GTC 

CGG 

TCT 

AAG 

ACC 

AAG 

GTT 

147 


Asp 

Ser 

Val 

Lys 

Lys 

lie 

Lys 

Glu 

His 

Va 1 

Arg 

Ser 

Ly s 

Thr 

Lys 

Val 




30 





3 5 





40 





25 

CCT 

GTG 

CAG 

GAC 

CAG 

GTT 

CTT 

TTG 

CTG 

GGC 

TCC 

AAG 

ATC 

TTA 

AAG 

CCA 

195 


Pro 

Val 

Gin 

Asp 

Gin 

Val 

Leu 

Leu 

Leu 

Gly 

Ser 

Lys 

lie 

Leu 

Lys 

Pro 




45 





5 0 





55 







CGG 

AGA 

AGC 

CTC 

TCA 

TCT 

1 rt 1 

GGC 

ATT 

GAC 

AAA 

GAG 

AAG 

ACC 

ATC 

CAC 

243 

30 

Arg 

Arg 

Ser 

Leu 

Ser 

Ser 

Tyr 

Gly 

lie 

Asp 

Ly s 

Glu 

Lys 

Thr 

lie 

His 



60 





6 5 





n f| 





75 



CTT 

ACC 

CTG 

AAA 

GTG 

GTG 

AAG 

CCC 

AGT 

GAT 

GAG 

GAG 

CTG 

CCC 

TTG 

TTT 

291 


Leu 

Thr 

Leu 

Lys 

Val 

Va 1 

Lys 

Pro 

Ser 

rtSp 

Glu 

Glu 

Leu 

Pro 

Leu 

Phe 


35 





30 





85 





90 




CTT 

GTG 

GAG 

TCA 

GGT 

GAT 

GAG 

GCA 

AAG 

AGG 

CAC 

CTC 

CTC 

CAG 

GTG 

CGA 

339 


Leu 

Val 

Glu 

Ser 

Gly 

Asp 

Glu 

Ala 

Lys 

Arg 

His 

Leu 

Leu 

Gin 

Val 

Arg 






95 





10 0 





105 




40 

AGG 

TCC 

AGC 

TCA 

GTG 

GCA 

CAA 

GTG 

> ^ t 

GCA 

ATG 

ATC 

GAG 

ACT 

AAG 

ACG 

387 


Arg 

Ser 

Ser 

Ser 

Va 1 

Ala 

GI n 

Va 1 

Lys 

A 1 a 

Met 

I le 

Glu 

Thr 

Ly s 

Thr 




110 





115 





120 





45 

GGT 

ATA 

ATC 

CCT 

GAG 

ACC 

CAG 

ATT 

GTG 

ACT 

TGC 

AAT 

GGA 

AAG 

AGA 

CTG 

435 


Gly 

lie 

He 

Pro 

Glu 

Thr 

Gin 

He 

Va 1 

Thr 

Cys 

Asri 

Gly 

Lys 

Arg 

Leu 




125 





130 





135 







GAA 

GAT 

GGG 

AAG 

ATG 

ATG 

GCA 

GAT 

TAC 

GGC 

ATC 

AGA 

AAG 

GGC 

AAC 

TTA 

483 

50 

Glu 

Asp 

Gly 

Lys 

Met 

Met 

Ala 

Asp 

Tyr 

Gly 

I le 

Arg 

Lys 

Gly 

Asn 

Leu 



140 





145 





150 





155 



CTC 

TTC 

CTG 

GCA 

TCT 

TAT 

TGT 

ATT 

GGA 

GGG 

TGACCACCCT ' 

GGGGATGGGG 

533 

cr rr 

Leu 

Phe 

Leu 

Ala 

Ser 

i r a 

Tyr 

Cys 

lie 

Gly 

Gly 

i rc: 









55 ISO 165 


TGTTGGCAGG GGTCAAAAAG CTTATTTCTT TTAATCTCTT ACTCAACGAA CACATCTTCT 593 
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Table 1 (continued) 


GATGATTTCC 

AGGATGAAGT 

CATGATTTTT 

ATTC 


CAAAATTAAT 

ATATTGCCCA 

ACTAATGTAT 


GAGAATGAGm 

ACTCTATGTT 

TACTGAGACT 


TGAGTAGAGT 
TCTTTGATTC 
AGTAAATAAA 


AAGATTTGGG TGGGATGGGT 


TAACACAATT AATTAAGTGA 
TTTTTAAGGG AAAATAGAGC 


653 

713 

773 


counterpart mouse DU gene (SEQ ID NO: 3 and 4): 


TACAGAC 

ATG 

Met 

1 

GCT 

Ala 

TCT 

Ser 

GTC 

Va 1 

CGC 

Arg 

5 

ACC 

Thr 

TGT 

Cys 

GTT 

Val 

GTC 

Va 1 

CGT 

Arg 

10 

TCA 

Ser 

GAC 

Asp 

CAA 

Gin 

TGG 

Trp 

49 

CGG 

TTA 

ATG 

ACC 

TTT 

GAG 

ACC 

ACT 

GAG 

AAT 

GAC 

ArtA 

GTG 

AAG 

AAG 

AT A 

97 

Arg 

15 

Leu 

Met 

Thr 

Phe 

Glu 

20 

Thr 

Thr 

Glu 

Asn 

Asp 

25 

Ly s 

Va 1 


Lys 

lie 

30 


AAT 

GAA 

CAT 

ATT 

AGG 

TCC 

CAA 

ACC 

AAG 

GTC 

rpr'HP 

GTA 

C AG 

GAC 

CAG 

ATC 

145 

Asn 

Glu 

His 

lie 

Arg 

3 5 

Ser 

Gin 

Thr 

Lys 

Val 

4 0 

Ser 

Val 

Gin 

MSp 

Gin 

45 

lie 


CTT 

CTG 

CTA 

GAC 

TCC 

AAA 

ATC 

CTC 

AAG 

CCC 

CAT 

CGA 

AAA 

TTG 

TCA 

TCC 

193 

Leu 

Leu 

Leu 

Asp 

50 

Ser 

Lys 

lie 

Leu 

Ly s 
55 

Pro 

His 

Arg 

Lys 

Leu 

6 0 

Ser 

Ser 


TAT 

GGG 

ATT 

GAC 

AAG 

G,-iA 

ACC 

ACT 

ATC 

CAC 

CTT 

ACC 

CTG 


GTG 

GTG 

241 

Tyr 

Gly 

lie 

65 

Asp 

Lys 

Glu 

Thr 

Thr- 

70 

He 

His 

Leu 

Thr 

Leu 

75 

Ly s 

Va 1 

Va 1 


AAG 

CCC 

AGT 

GAT 

G.~vA 

GAG 

CTG 

CCC 

TTG 

TTT 

CTG 

GTG 

GAG 

TCC 

AAA 

AAC 

289 

Lys 

Pro 

30 

Ser 

Asp 

Glu 

Glu 

Leu 

35 

Pro 

Leu 

Phe 

Leu 

Va 1 

9 0 

Glu 

Ser 

Lys 

Asn 


GAG 

GGG 

CAA 

AGG 

CAC 

CTC 

CTC 

CGA 

Qrprp 

CGA 

AG A 

TCC 

AGC 

t~a 

GTG 

GCC 

337 

Glu 

95 

Gly 

Gin 

Arg 

His 

Leu 

10 0 

Leu 

Arg 

Va 1 

Ar g 

Arg 

1 0 5 

Ser 

Ser 

Ser 

Va 1 

Ala 

110 


CAG 

GTG 

AAA 

GAG 

ATG 

ATC 

GAG 

ACT 

GTG 

ACC 

TCT 

GTG 

ATC 

CCT 

AAG 

AAG 

385 

Gin 

Val 

Lys 

Glu 

Met 

115 

lie 

G 1 u 

Ser 

Va i 

Thr 

1 2 0 

Ser 

Va 1 

lie 

Pro 

Ly s 

1 2 5 

Ly s 


CAG 

GTT 

GTG 

AAT 

TGC 

AAC 

GO A 

AAG 

AAG 

CTG 

GAA 

GAT 

GGA 

AAG 

ATC 

ATG 

433 

Gin 

Val 

Va 1 

Asn 

13 0 

Cys 

as r i 

Gly 

Lys 

Lys 

135 

Leu 

Glu 

Asp 

Gly 

Lys 

140 

He 

Met 


GCT 

GAC 

TAC 

AAC 

ATC 

aAG 

AGT 

GGC 

AGT 

TTG 

CTC 

TTT 

CTG 

AC A 

ACA 

CAC 

431 

Ala 

Asp 

Tyr 

145 

Asn 

He 

Ly s 

Ser 

Gly 

150 

Ser 

Leu 

Leu 

Phe 

Leu 

155 

Thr 

Thr 

His 



55 


TGC ACT GGG GGA TG, 
Cys Thr Gly Gly 
160 


496 
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Table 1 (continued): 


comparison of human and mouse diubiquitin polypeptide sequences 


hDU 

1 

MAPNASCLCVHVRSEEWDLMTFDANPYDSVKKIKEHVRSKTKVPVQDQVL 

50 

mDU 

1 

MASVRTCV VRSDQWRLMTFETTEUDKYKKINEHIRSQTKYSYQDQIL 

* * * t t, t * * t * t * * * * * * * tt It * It It * * * * ^ * 

47 

hDU 

51 

LLGSKILKPRRSLSS YGIDKEKT1HLTLKWK PSDEELPLFLVESGDEAE 

100 

mDU 

43 

LLDSKILKPHRKLSSYGIDKETTIHLTLKVVKPSDEELPLFLVESKNEGQ 

* * ****** * ********* *********************** * 

97 

hDU 

101 

RHLLQVRRSSSVAQVKAMIETKTGIIPETQIYTCNGKRLEDGKMMADYGI 

150 

mDU 

93 

RHLLRVRRSSSVAQVKEMIESVTSVIPKKQVVNCNGKKLEDGKIMADYNI 

* * * * ******** *•* * * * * ^ * ^ * * ^.*.****.*****. ****.* 

147 

hDU 

151 

RKGNLLFLASYCIGG ,15 5 


mDU 

143 

KSGSLLFLTTHCTGG 1.52 

***** *** \ 




BATES, et al . 


5 


10 


15 


20 


25 


30 


35 


40 


45 


50 


55 


Table 2: Sequence encoding a protein related to Ig family members, 

designated A07C03, isolated from an activated CDla dendritic cell library. 
At about positions 573 and 710, various isolates have various 
insert ions/delet ions which suggest positions of intron splicing. A 
putative signal sequence may run from about -22 (met) to about -1 (val), 
and a potential transmembrane segment runs from about 132 (phe) to about 
154 (leu). Certain cysteine residues, e.g., at positions 29 and 96 are 
characteristic of Ig domains. A region similar to the J chain of a type 1 
variable chain runs from about 112 (gly) to about 119 (val). Two putative 
glycosylation sites are found in the part amino proximal to the 
transmembrane portion, with various putative phosphorylation sites in the 
carboxy proximal part. See SEQ ID NO: 5 and 6. Sequence analysis suggests 
A07C03 is a member of the Ig superfamily of receptors, and is closely 
related to the CDS family, which contain a VIJ-type fold. A mouse 
counterpart is probably encoded in the EST '.'.'55567. 


TTCCTTTCAA ATACACACCC CAACCCGCCC CGGCATACAC 




ATG GGC 


ACT GCG 


56 


AGC 

AGA 

AGC 

AAC 

ATC 

GCT 

Ser 

Arg 

Ser 

Asn 

-15 

lie 

Ala 

TGT 

GTC 

GGT 

GCT 

GTG 

GGC 

Cy s 

Val 

Gly 

1 

Ala 

Va 1 

Gly 

TAC 

CTA 

GAA 

GTG 

GAC 

TAC 

Tyr 

15 

Leu 

Glu 

Val 

Asp 

Tyr 

20 

TTC 

TCC 

GCA 

ACC 

GGA 

TGC 

Phe 

Ser 

Ala 

Thr 

Gly 

3 c, 

Cys 

CGC 

TAC 

GGT 

GCT 

C AC 

C 

Arg 

Tyr 

Gly 

Ala 

5 0 

His 

Gin 

AGT 

GAG 

GCA 

GAC 

AAG 

TTC 

Ser 

Glu 

Ala 

65 

Asp 

Lys 

Phe 

GTT 

TCC 

CTC 

ACT 

GTA 

AAC 

Val 

Ser 

Leu 

Thr 

Val 

Asn 


30 


ATC 

TGT 

GGA 

ATA 

GCA 

TTC 

lie 

95 

Cys 

Gly 

lie 

A1 a 

Phe 

1 0 0 

AC A 

GGA 

GGA 

GGG 

ACC 

ACA 

Thr 

Gly 

Gly 

Gly 

Thr 

115 

Thr 


CGC 

AT 

CTG 

CAA 

r\ C 

CT.AT 

Arg 

His 

Leu 

Gin 

Thr 

Asn 



-10 




GCC 

TGT 

ACT 

CTC 

TCT 

GTC 

Ala 

Cys 

Thr 

Leu 

Ser 

Va 1 

5 





10 

ACT 

CAT 

GAG 

GCC 

GTC 

ACC 

Thr 

His 

Glu 

rt 1 cl 

Va 1 

Thr 





o c 


CCT 

TCT 

GAG 

CAA 

C C A 

cV- .-i 

Pro 

Ser 

Glu 

Gin 

4 0 

Pr o 

Thr 

CCT 

GrtU 

AnV 

CTG 

TGC 

TTG 

Pro 

Glu 

Asn 

Leu 

Cys 

Leu 



5 5 





GTG 

AGG 

GAG 

C' C' T 

CTC 

Th i- 

'.’a 1 

A r g 

Glu 

A1 a 

Leu 


7 n 





AG A 

GTG 

ACT 

TCA 

^iaaT 

GAC 

Arg 

Va 1 

Thr 

Ser 

Asn 

Asp 

85 





9 0 

CCC 

AGT 

GTG 

CCG 

GAA 

GCG 

Pro 

Ser 

Va 1 

Pro 

Glu 

Ala 





1 0 5 


CTG 

GTG 

GTA 

rvC A 

GAA 

ATT 

Leu 

Va I 

Va 1 

Arg 

Glu 

He 


12 0 


Met 

-22 

Gly 

• Thr 

-20 

Ala 


CTC 

ATT 

CTA 

TTT 

104 

Leu 

I le 

-5 

Leu 

Phe 


ACA 

CAA 

CCG 

TGG 

152 

Thr 

Gin 

Pro 

Trp 


ATA 

AAG 

TGT 

ACC 

200 

lie 

Lys 

Cys 

Thr 

30 


TGC 

CTG 

TGG 

TTT 

248 

Cys 

Leu 

Trp 

4 5 

Phe 


GAC 

GGG 

TGC 

AAA 

296 

Asp 

Gly 

60 

Cys 

Ly s 


AAA 

GAA 

AAC 

C A~A 

344 

Ly s 

7 c; 

Glu 

Asn 

Gin 


AGT 

GC A 

ATT 

TAC 

392 

Ser 

Ala 

lie 

Tyr 


AGA 

GCT 

svAA 

CAG 

440 

Arg 

Ala 

Lys 

Gin 

110 


AAG 

CTG 

CTC 

AGC 

488 

Ly s 

Leu 

Leu 

125 

Ser 
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Table 2 

(continued) 





AAG 

GAA 

CTG 

CGG 

AGC 

TTC 

CTG 

ACA 

5 

Lys 

Glu 

Leu 

Arg 

130 

Ser 

Phe 

Leu 

Thr 


TAT 

GTG 

ACC 

GGT 

GTG 

TGC 

GTG 

GCC 

10 

Tyr 

Val 

Thr 

145 

Gly 

Val 

cys 

V a 1 

Ala 

150 


TCC 

AAC 

CCT 

CTA 

AGA 

AAG 

AAA 

GAA 


Ser 

Asn 

160 

Pro 

Leu 

Arg 

Lys 

Lys 

165 

Glu 

15 

AAG 

AGT 

GCT 

CGG 

CGT 

ATT 

TTT 

CAG 


Lys 

175 

Ser 

Ala 

Arg 

Arg 

I le 

13 0 

Phe 

Gin 


AAG 

AGA 

CAT 

GTG 

GAA 

ACA 

AAT 

CAG 

20 

Lys 

Arg 

His 

Val 

Glu 

195 

Thr 

Asn 

Gin 


TAT 

GAA 

AAC 

AGA 

AGA 

GTA 

CTT 

TCC 

25 

Tyr 

Glu 

Asn 

Arg 

210 

Arg 

7a 1 

Leu 

Ser 


TTAATTTTCA ATGAAGTCAC TGAAAATCC 


GCT 

CT*T 

GTA 

TCA 

CTG 

CTC 

TCT 

GTC 

536 

Ala 

135 

Leu 

Val 

Ser 

Leu 

Leu 

140 

Ser 

v T a 1 


TTC 

ATA 

CTC 

CTC 

TCC 

AAA 

TCA 

AAA 

534 

Phe 

lie 

Leu 

Leu 

Ser 

155 

Lys 

Ser 

Lys 


ATA 

AAA 

GAA 

GAC 

TCA 

CAA 

AAG 

AAG 

632 

He 

Lys 

Glu 

Asp 

170 

Ser 

Gin 

Lys 

Lys 


GAA 

ATT 

GCT 

CAA 

GAA 

CTA 

TAC 

CAT 

680 

Glu 

lie 

Ala 

135 

Gin 

Glu 

Leu 

Tyr 

His 

19 0 


CAri 

TCT 

GAG 

AAA 

GAT 

AAC 

AAC 

ACT 

728 

Gin 

Ser 

2 0 0 

Glu 

Lys 

Asp 

Asn 

Asn 

205 

Thr 


AAC 

Asn 

TAT 

Tyr 

GAA 

Glu 

AGG 

Arg 

CCA 

Pro 

TAG. 

AAACGTT 

777 


215 

ACTCCAGGAG CTATGGCAGT GTTAATGAAC 8 37 


30 


ATATATCATC 

GAAGGATGTC 


AGGTCTTAAA AAAAAAATAA AGGTAAACTG AAAAGACAAC 
AGAATGTAAG GAAACTATAA CTAATAGTCA TTACCAAAAT 


TGGCTACAAA 

ACTAAAACCC 


897 

957 


AACAAAATGC AACTGAAAAA TACCTTCX 


VTTTGi 




ITT rTlTTiilfT 


35 


AAAAAAAAAA 


AAAAA 


1040 


40 


45 


improved human sequence (SEQ ID NO: 7 and 3): 

TTCCTTTCAA ATACACACCC CAACCCGCCC CGGCATACAC AGAA ATG GGG ACT GCG 

Met Gly Thr Ala 
_ -j o - 2 0 


AGC 

AGA 

AGC 

AAC 

ATC 

GCT 

CGC 

CAT 

CTG 

CAA 

.-V>~ H 

AAT 

CTC 

ATT 

CTA 

TTT 

Ser 

Arg 

Ser 

Asn 

-15 

lie 

Ala 

Arg 

His 

Leu 

i n 

X w 

G1 n 

Thr 

Asn 

Leu 

lie 

. ^ 

Leu 

Phe 

TGT 

GTC 

GGT 

GCT 

GTG 

GGC 

GCC 

TGT 

ACT 

CTC 

rr* T' 
i ^ 1 

GTC 

ACA 

C^A 

CCG 

TGG 

Cys 

Val 

Gly 

1 

Ala 

Va 1 

Gly 

Ala 

5 

Cys 

Thr 

Leu 

Ser 

Va 1 

10 

Thr 

Gin 

Pro 

Trp 

TAC 

CTA 

GAA 

GTG 

GAC 

TAC 

ACT 

CAT 

GAG 

GCC 

GTC 

ACC 

ATA 

AAG 

TGT 

ACC 

Tyr 

15 

Leu 

Glu 

Va 1 

Asp 

'TV r 

2 0 

Thr 

His 

Glu 

Ala 

Val 

2 5 

Thr 

lie 

Lys 

Cys 

Thr 

30 


50 


200 
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Table 2 (continued): 


TTC 

TCC 

GCA 

ACC 

GGA 

TGC 

CCT 

TCT 

GAG 

/— r 7. 

CCA 

Phe 

Ser 

Ala 

Thr 

Gly 

Cys 

Pro 

Ser 

Glu 

Gin 

Pro 





35 





4 0 


CGC 

TAC 

GGT 

GCT 

CAC 

CAG 

CCT 

c* ■* c* 

AAC 

CTG 

TGC 

Arg 

Tyr 

Gly 

Ala 

His 

Gin 

Pro 

Glu 

Asn 

Leu 

C' ’ 

v_ J O 




50 





55 



AGT 

GAG 

GCA 

GAC 

AAG 

TTC 

ACA 

GTG 

AGG 

GAG 

GCC 

Ser 

Glu 

Ala 

Asp 

Lys 

Phe 

Thr 

Va 1 

Arg 

Glu 

Ala 



65 





70 




GTT 

TCC 

CTC 

ACT 

GTA 

AAC 

AGA 

GTG 

ACT 

TCA 

AAT 

Val 

Ser 

Leu 

Thr 

Va 1 

Asn 

Arg 

Va 1 

Thr 

Ser 

Asn 


80 





35 





ATC 

TGT 

GGA 

ATA 

GCA 

TTC 

CCC 

AGT 

GTG 

CCG 

Or^A 

lie 

Cys 

Gly 

He 

Ala 

Phe 

Pro 

3 c- T 

Va 1 

Pro 

Glu 

95 





100 





105 

ACA 

GGA 

GGA 

GGG 

ACC 

ACA 

CTG 

GTG 

GTA 

AGA 

GAA 

Thr 

Gly 

Gly 

Gly 

Thr 

Thr 

Leu 

Val 

Va 1 

Arg 

Glu 





115 





12 0 


AAG 

GAA 

CTG 

CGG 

AGC 

■ i up-—> 

CTG 

■*. p £ 

GCT 

CTT 

GTA 

Lys 

Glu 

Leu 

Arg 

Ser 

Phe 

Leu 

Thr 

Ala 

Leu 

Va 1 


130 135 


TAT 

GTG 

ACC 

GGT 

GTG 

TGC 

GTG 

GCC 

TTC 

ATA CTC 

Tyr 

Val 

Thr 

145 

Gly 

V a 1 

Cys 

Va 1 

Ala 

150 

Phe 

lie Leu 

TCC 

AAC 

CCT 

CTA 

AGA 

AAG 

AAA 

GAA 

ATA 

AAA GAA 

Ser 

Asn 

160 

Pro 

Leu 

Arg 

Lys 

Lys 

165 

Glu 

lie 

Lys Glu 



AAG 

AGT 

GCT 

CGG 

CGT 

ATT 

TTT 

CAG 

GAA 

ATT 

GCT 

40 

Lys 

175 

Ser 

Ala 

Arg 

Ar g 

lie 

13 0 

Phe 

Gin 

Glu 

lie 

Ala 

13 5 


AAG 

AGA 

CAT 

GTG 

GAA 

ACA 

AAT 

CAG 

C A A 

TCT 

GAG 

45 

Lys 

Arg 

His 

Va 1 

Glu 

195 

Thr 

Asn 

Gin 

Gin 

Ser 

200 

Glu 


50 


TAT GAA AAC AGA AGA GTA 
Tyr Glu Asn Arg Arg 7a1 
210 

TTAATTTTC A ATGAAGTCAC T' 


CTT 

TCC 

AAC 

TAT 

GAA 

Leu 

Ser 

Asn 

Tyr 

G 1 u 



215 



AAAATCC 

A AC 

TC C rt'oLmG 


^-\C 

TGC 

CTG 

TGG 

TTT 

248 

Thr 

Cys 

Leu 

Trp 

45 

Phe 


TTG 

GAC 

GGG 

TGC 

-*> " ■*! 

296 

Leu 

Asp 

Gly 

6 0 

Cys 

Ly s 


CTC 

AAA 

GAA 

AAC 

CAA 

344 

Leu 

Lys 

75 

Glu 

Asn 

Gin 


GAC 

AGT 

GCA 

ATT 

TAC 

392 

Asp 
9 0 

Ser 

Ala 

lie 

Tyr 


GCG 

AGA 

GCT 

AAA 

CAG 

440 

Ala 

Arg 

Ala 

Lys 

Gin 

110 


ATT 

AAG 

CTG 

CTC 

AGC 

438 

lie 

Lys 

Leu 

Leu 

125 

Ser 


TCA 

CTG 

CTC 

TCT 

GTC 

536 

Ser 

Leu 

Leu 

140 

Ser 

Val 


CTC 

TCC 

AAA 

TCA 

AAA 

584 

Leu 

Ser 

155 

Lys 

Ser 

Lys 


G 

TCA 

CAA 

AAG 

AAG 

632 

S 

170 

Ser 

Gin 

Lys 

Lys 


CAA 

GAA 

CTA 

TAC 

CAT 

680 

Gin 

Glu 

Leu 

Tyr 

His 

190 


AAA 

GAT 

AAC 

AAC 

ACT 

728 

Lys 

Asp 

Asn 

Asn 

205 

Thr 



AGG CCA TAGAAACGTT 777 

Arg Pro 

CTATGGCAGT GTTAATGAAC 837 


ATATATCATC AGGTCTTAAA AAAAAAATAA AGGT/ 


TGGCTACAAA 


397 
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: 9 K 


5 


10 


15 


20 


25 


30 


35 


40 


45 


50 


Table 2 (continued) : 

GAAGGATGTC AGAATGTAAG GAAACTATAA CTAATAGTCA TTACCAAAAT ACTAAAACCC 


957 


AACAAAATGC AACTGAAAAA TACCTTCCAA ATTTGCCAAG AAAAAAAATT CTATTCCAAA 1017 

CTAAAAAAAA AAAAAAAAAA AA.AAA 1042 


counterpart mouse A07C03 (SEQ ID MG: 9 
CCACGCGTCC GGGAAAAGGC GGCACATGCA CCAGCG 


AGC 

AGG 

AGG 

GGC 

CTC 

CGG 

CTA 

GGA 

ATC 

AGC 

Ser 

Arg 

Arg 

Gly 

Leu 

Arg 

Leu 

Gly 

He 

Ser 


-15 





-10 




GGT 

GTG 

GTG 

GGC 

GCC 

TGT 

ACT 

GTA 

TCT 

GTG 

Gly 

Val 

Val 

Gly 

Ala 

Cys 

Thr 

Val 

Ser 

Va 1 

1 




5 





10 

GAG 

GTG 

GAC 

TAC 

ACG 

TCT 

CAG 

ACT 

GTC 

ACC 

Glu 

Val 

Asp 

Tyr 

Thr 

Ser 

Gin 

Thr 

Va 1 

Thr 




20 





25 


AC A 

ACT 

GGA 

TGC 

CCT 

GCA 

GTG 

CAA 

CCA 

AAA 

Thr 

Thr 

Gly 

Cys 

Pro 

Ala 

Val 

Gin 

Pro 

Ly s 



35 





4 0 



GGC 

ACT 

CAC 

CAG 

CCT 

GAA 

GCT 

CTG 

TGC 

TTG 

Gly 

Thr 

His 

Gin 

Pro 

Glu 

Ala 

Leu 

Cy s 

Leu 


50 





5 5 




GCA 

GAC 

AAG 

TTC 

ACA 

GTG 

AAA 

GAA 

ACC 

CTG 

Ala 

Asp 

Lys 

Phe 

Thr 

Va 1 

Lys 

Glu 

Thr 

Leu 

65 





7 0 





CTC 

ACT 

GTT 

AAC 

AGG 

CTG 

TCT 

CCA 

AAT 


Leu 

Thr 

Val 

Asn 

Arg 

Leu 

Ser 

Pro 

Asn 

Asp 





35 





9 0 

GGA 

ATA 

GCA 

TTT 

CCC 

AAT 

GAA 

CCG 

GTA 

CCA 

Gly 

lie 

Ala 

Phe 

Pro 

Asn 

Glu 

Pro 

Va 1 

Pro 




10 0 





10 5 


GAC 

GGG 

ACT 

AC A 

CTG 

GTG 

GTA 

AC,A 

GAA 

AGA 

Asp 

Gly 

Thr 

Thr 

Leu 

Va 1 

Va 1 

Arg 

Glu 

Arg 



115 





120 



CAC 

AGT 

CTC 

CTG 

ATA 

GTG 

CTC 

TTA 

GCA 

CTG 

His 

Ser 

Leu 

Leu 

I le 

Val 

Leu 

Leu 

A1 ci 

Leu 


130 





13 5 





and 10 : 


y 'pc*' 
r\ I ^ 

GGC 

CCT 

GTG 

AGC 

y C'r' 

54 

Met 

_ o 

Gly 

Pro 

- 2 0 

V a 1 

Ser 

Thr 


CTG 

ATC 

CTT 

CTT 

CAA 

GTT 

102 

Leu 

lie 

- 5 

Leu 

Leu 

Gin 

Val 


CTA 

CAG 

CCA 

GGT 

TAC 

CTA 

150 

Leu 

Gin 

Pro 

Gly 

Tyr 

15 

Leu 


ATG 

GAG 

TGT 

ACC 

TTT 

TCT 

198 

Met 

Glu 

Cys 

Thr 

3 0 

Phe 

Ser 


AGC 

TTG 

fT'A’C 

x 

TTT 

CGC 

TGT 

246 

Ser 

Leu 

Trp 

45 

Phe 

Arg 

Cys 


GAC 

GGA 

TGC 

AGA 

AAT 

GAG 

294 

Asp 

Gly 
6 0 

Cy s 

Arg 

Asn 

Glu 


G AC 

CAG 

A 

u. 

GTC 

TCC 

342 

s P 

" r. 

G1 r. 

Asn 

A r g 

Val 

Se r 

30 


AGT 

V 1 V. M 

ATC 

TAC 

ATC 

TGT 

390 

Cer 

Ala 

lie 

Ty r 

lie 

95 

Cys 


AC A 

GCC 

AAA 

CAG 

ACT 

GGA 

438 

r Pr~, 

Ala 

Lys 

Gin 

110 

Thr 

Gly 


CTT 

TTC 

AGC 

AGG 

GAG 

GTG 

486 

Leu 

Phe 

Ser 

125 

A r g 

Glu 

Val 


CTC 

GCA 

GTC 

TAC 

GTC 

ACC 

534 

Leu 

Ala 

140 

Va 1 

Tyr 

Va 1 

Thr 




BATES 


et ai 


Table 2 (continuedj : 



GGT 

GTG 

TGT 

GTG 

ATC 

TTC 

ATA 

GTC 

CTC 

TTC 

AGA 

TCA 

AAA 

TCT 

AAC 

ACT 

582 


Gly 

Val 

cys 

Va 1 

lie 

Phe 

lie 

Va 1 

Leu 

Phe 

Arg 

Ser 

Ly s 

del 

Asn 

Thr 


5 

145 





150 





155 





16 0 



CCA 

AGA 

AGC 

AGA 

GAA 

ACC 

nno 

GAA 

GAC 

TCG 

■* *• 7\ 

AAG 


j. 

GCT 

CG A 

630 


Pro 

Arg 

Ser 

Arg 

Glu 

Thr 

Ly s 

Glu 

Asp 

Ser 

Ly s 

Ly s 

Ly s 

Ser 

Ala 

Arg 







165 





17 0 





175 



10 

CGT 

ATC 

TTC 

CAG 

GAA 

ATT 

GCT 

C A A 

GAA 

TTA 

TAC 

CAT 

AAG 

Ao A 

TAT 

GTG 

0^ 

"J 

CD 


Arg 

He 

Phe 

Gin 

Glu 

He 

Ala 

Gin 

Glu 

Leu 

, T' V 

His 

Lys 

Arg 

Tyr 

Va 1 






180 





135 





190 




15 

GAA 

AC A 

AGT 

CAT 

CAG 

CCT 

GAG 

r' ■* > 
r\t-\ 

GA.C 

GGC 

A AT 

TAT 

GAA 

AAC 

AGA 

AAA 

726 


Glu 

Thr 

Ser 

His 

Gin 

Pro 

Glu 

Gin 

rtSp 

Gly 

n s n 

Tyr 

Glu 

Asn 

Arg 

Lys 





195 





o r, t 





2 05 






GCA 

CTC 

CCC 

AGC 

CCT 

GGA 

AGA 

C'uh 

T AG, 

A TGT'. 

GCT GACTTTTTAC TTAAAi 

CCATT 

780 

20 

Ala 

Leu 

Pro 

Ser 

Pro 

Gly 

Ar g 

Pro 











210 



GACAGTGCAA 

CTCCAGAATC 

TATGGCAGTG 

TGAATGGACA 

TACAGCAATC 

CAAACAACAG 

840 

25 

CAAAGAGAGC 

TGAGGTGTAG 

CTTGAGTGGC 

t, **. /--* ryi.^ rTirT^y* 

rtrtrto I H i i ir* 

CCCAGTAGGC 

ATGAAGTCTT 

900 


r - mmrrv * 7s rric 1 
ill J. V- 

O Tv z - ' y T, 

H i. L. ri '-J s^.. nuc ca 

CATAACTCAG 

r' ^ rp ( q r ^ 

C A AG C C TG T A 

TTCCCAACAT 

960 


TGTGTAGTAG 

TATAAAAAGT 

C AGAAGTTC A 

AGGTCATCCC 

TGACTATAGG 

ATGAACCTGA 

1020 

30 

AGTCAGAGAC 

ATGTTATCTT 

,—■ oro •—' ryi 7, > ? ^ > 

a lb i H 

CACTGCCACC 

rtC H aAcj AGz^.A 

AAGGGCAGGA 

1080 


CAAGTGGGAA 

AACAGCCAGT 

CACGCCAGAA 

GGCAGAGCGG 

AAGTAACTGT 

CACGAACCAT 

1140 

35 

AATGATGGAA 

TG TG A A AAC C' 

TCAAGAAAAC 

TC A A C TG G AG 

GAC ctTTTTT 

CTAATTTTCC 

1200 


AGGAACAGTC 

TAAGGAGCCT 

C ATTTTAAAG 

AAAAACTTCA 

CCTTCAGCTT 

TTA 

1253 


\ 


40 


compan 

son of human 

and Xoouse 

protein 

sequences : 




human 

MG...TASKS 

MIARHLQTKL 

ILFCVGAVGA 

CTLSVTQPWY 

LEVDYTHEAV 



mouse 

MO; p* ?qrp qpp 

GL . P.. LG IS L 

I LLQVGT-.'GA 

C TVSVL 0PGY 

levdytsotv 

45 


human 

TIKCTFSATG 

CPSEQPTCLV: 

FRYGAHQPEN 

LCLDGCKSEA 

DKFTVREALK 


,V>. p 

mouse 

TMECTFSTTG 

C P AVQ PK S LSI, 

FRCGTHQPEA 

LCLDGCRNEA 

DKFTVKETLD 


\\ '> 

'/ H 

human 

ENQVS LTVKR 

VTSNDSAIYI 

CGIAFPS..V 

PEARAKQTGG 

GTTLWREIK 


i 

mouse 

QNRVSLTVKR 

LS PtiDSAI YI 

dGIAFPMEKV 

P..TAKQTGD 

GTTLWRE . R 

50 





\ 





human 

LLSKELRSFL 

TALVS LLSVY 

VTdVCVAFIL 

LSKSKSN.PL 

RKKEIKEDSQ 



mouse 

LFSREVHSLL 

I v LLz^LLrt j i 

VTGVCVIFIV 

LFRSI'SMTP . 

RSRETKEDS. 



human 

KKKSARRIFQ 

EIAQELYHKR 

HVETMQQGEK 

DNNTYENRRV 

LSNYERP 

55 


mouse 

KKKSARRIFQ 

eiaoelyhkr 

YVETSHOPEQ 

DGM.YEMRKA 

LPSPGRP 





BATES, et al 


Table 3: Sequence 

designated E02B02, 
protein exhibits 
family, see human 
5 hydrophobic length 

signal sequence; a 
leu333; and a serine/p 
prol84 to serl99. The 
sites, intracellular t 

10 

CGCCCGGGCA GGTAGCGGCC GCTGAATTCT 


to LAMP-like family members, 
dendritic cells. The encoded 
Membrane Protein (LAMP) 
features are a 
(ser), putatively a 
rorri about ile359 to 

a hinge from about 
osylation 


AGAACGCCCA CC ATG CCC CGG CAG 54 

Met Pro Arg Gin 
-23 -20 


encoding a protein related 
isolated from human CDla 
homology to Lysosome-Associated 
LMP1 and LMP2 and CD 68 . Notab 
from about -23 (met) to about -i 
putative transmembrane segment f 

oline rich stretch suggestive of 
sequence also exhibits presumpti 
yrosine. See SEQ ID NO: 11 and 1 


ve 
2 . 


gly ; 


15 

CTC 

AGC 

GCG 


Leu 

Ser 

Ala 


GAT 

GGC 

AGT 

20 

Asp 

Gly 

Ser 


TCT 

CAA 

CCT 

25 

Ser 

Gin 

15 

Pro 


CAG 

CAA 

CCA 


Gin 

30 

Gin 

Pro 

30 


ATG 

GAT 

GGT 


Met 

Asp 

Gly 

35 

ACA 

ACT 

ACC 


Thr 

Thr 

Thr 


ACC 

TAG 

ACC 

40 

Thr 

Tyr 

Thr 

8 0 


GCT 

CCT 

CCA 

45 

Ala 

Pro 

95 

Pro 


TCA 

CTG 

CCA 


Ser 

110 

Leu 

Pro 

50 


TCA 

TCA 

ACC 


Ser 

Ser 

Thr 


GCG 

GCC 

GCG 

CTC 

TTC 

Ala 

Ala 

Ala 

Leu 

Phe 


-15 




CAA 

ATG 

AG A 

GCA 

AAA 

Gin 

Met 

Arg 

Ala 

Lys 

1 




5 

ACT 

GCA 

GCA 

GCA 

ACA 

Thr 

Ala 

Ala 

Ala 

Thr 




2 0 


GCT 

■ 

Ad.*d vJ» 

C A A 

GCA 

CCT 

A i a 

Lys 

Gin 

A i a 

Pro 



■j 5 



CAT 

ATC 

ACC 

TTT 

CAA 

His 

He 

Thr 

Phe 

Gin 


5 0 




CCA 

GCA 

ACT 

AC 

AAA 

Pro 

Ala 

Thr 

Thr 

Lys 

65 





CTG 

GTC 

Ti r~> ■* 

Ad 

ACC 

H /Vo 

Leu 

Va 1 

Thr 

Thr 

G 1 r. 

GTT 

ACT 

G Cd .*-d 


Add— Ad 

Va 1 

Thr 

G 1 Li 

Va 1 

Thr 




1 0 0 


CCC 

ACC 

ATC 

ACC 

CCA 

Pro 

Thr 

lie 

Thr 

Pro 



115 



GTC 

AGC 

CAC 

n'v Ad 

ACT 

Va 1 

Ser 

His 

Thr 

Thr 


G^o 

TCC 

CTG 

GCC 

GTA 

Ala 

Ser 

-10 

Leu 

Ala 

Va 1 

GCA 

TTT 

CCA 

GAA 

ACC 

Ala 

Phe 

Pro 

Glu 

Thr 

10 

p'H' 

CAG 

GAC 

AT A 

AAA 

Va 1 

Gin 

Asp 

He 

c; 

Ly s 

CAC 

CAA 

ACT 

TTA 

GCA 

H is 

G i n 

Thi 

1 ", 


Ala 

ACA 

GCG 

GCC 

AlA 

GTA 

Thr 

Ala 

55 

Ala 

Thr 

Va 1 

A AC 

ACT 

GCA 

AC C. 

ACC 

Asn 

7 0 

Thr 

/d 1 Cl 

Thr 

Thr 

GCC 





-- - -- 

s ~ 0- 


A AC 

A i a 

Thr 

r r c: 

Asn 

Asn 

9 0 

LjTv 

GGC 

CCT 

AGC 

TTA 

Va 1 

Gly 

Pro 

105 

Leu 

CCA 

GCT 

CAT 

ACA 

ACT 

Pro 

Ala 

His 

12 0 

Thr 

Thr 

GGG 

A AC 

t. f—r' 

Cd v_. v_. 

ACT 

Cdbd 

G 1 y 

A s r. 

Thr 

Th r 

G 1 n 


ATT 

TTG 

CAC 

102 

lie 

Leu 
_ c 

His 


AG A 

GAT 

TAT 

150 

Arg 

Asp 

Tyr 


AAA 

CCT 

GTC 

198 

Lys 

Pro 

Va 1 


f oV— .-d 

AG A 

TTC 

246 

A 1 a 

Arg 

Phe 

45 


AAA 

ATT 

CCA 

294 

Lys 

lie 

60 

Pro 


AGC 

CCA 

ATT 

342 

Ser 

7 5 

Pro 

lie 


TCA 

CAC 

ACA 

390 

Ser 

His 

Thr 


GCC 

CCT 

TAT 

438 

Ala 

Pro 

Tyr 


GGA 

ACC 

AGT 

486 

Gly 

Thr 

Ser 

125 


CCC 

AGT 

AAC 

534 


S e r 


Pro 


Asn 


BATES 



Table 3 (continued): 



CAG 

ACC 

ACC 

CTT 

CCA 

GCA 

ACT 

TTA 

TCG 

A r-p * 
t\ i. 

GCA 

CTG 

CAC 

AAj-i 

AGC 

ACA 

582 


Gin 

Thr 

Thr 

Leu 

Pro 

Ala 

Thr 

Leu 

ber 

lie 

nld 

Leu 

His 

Lys 

Ser 

Thr 


5 




145 





150 





155 





ACC 

GGT 

CAG 

AAG 

/ ‘ ! ‘ r T' 

GTT 

CAA 

CCC 

ACC 

CAT 

GCC 

CCA 

GGA 

A ^ s-\ 

ACG 

GCA 

630 


Thr 

Gly 

Gin 

Lys 

Pro 

Vd 1 

Gin 

Pro 

Thr 

His 

Ala 

Pro 

Gly 

Thr 

Thr 

Ala 





160 





165 





17 0 





10 

GCT 

GCC 

CAC 

AAT 

ACC 

ACC 

CGC 

ACA 

GCT 

GCA 

CCT 

GCC 

TCC 

ACG 

GTT 

CCT 

678 


Ala 

Ala 

His 

Asn 

Thr 

Thr 

Arg 

Thr 

Ala 

Ala 

Pro 

Ala 

Ser 

Thr 

Va 1 

Pro 




175 





180 





135 






15 

GGG 

CCC 

ACC 

CTT 

GCA 

CCT 

CAG 

CCA 

TCG 

TCA 

GTC 

AAG 

ACT 

GGA 

ATT 

TAT 

726 


Gly 

Pro 

Thr 

Leu 

Ala 

Pro 

Gin 

Pro 

Ser 

S e r 

Va 1 

Lys 

Thr 

Gly 

lie 

Tyr 



190 





^ Ci Cl. 





2 0 0 





2 0 s 



CAG 

GTT 

CTA 

AAC 

GG A 

AGC 

AG A 

CTC 

TGT 

AT A 

AAA 

GCA 

GAG 

ATG 

GGG 

ATA 

774 

20 

Gin 

Val 

Leu 

Asn 

Gly 

Ser 

Arg 

Leu 

cys 

lie 

Ly s 

Ala 

Glu 

Met 

Gly 

He 







210 





215 





220 




CAG 

CTG 

ATT 

GTT 

CAA 

GAC 

AAG 

GAG 

TCG 

GTT 

TTT 

TCA 

CCT 

CGG 

AG A 

TAC 

822 


Gin 

Leu 

He 

Val 

Gin 


Ly s 

Glu 

Ser 

Val 

Phe 

Ser 

Pro 

Arg 

Arg 

Tyr 


25 




225 





230 





235 





TTC 

AAC 

ATC 

GAC 

CCC 


go * 

AC 3 

CAA 

GCC 

TCT 

GGG 

AAC 

r TG rp 

GGC 

ACC 

870 


Phe 

Asn 

lie 

Asp 

Pro 

Asn 

Ala 

Thr 

G i n 

Ala 

Ser 

G i y 

rtSTl 

Cys 

1 y 

Thr 





240 





245 





2 5 0 





30 

CGA 

AAA 

TCC 

AAC 

CTT 

CTG 

TTG 

AAT 

TTT 

CAG 

GGC 

GGA 

TTT 

GTG 

AAT 

CTC 

918 


Arg 

Lys 

Ser 

Asn 

Leu 

Leu 

Leu 

Asn 

Phe 

Gin 

Gly 

Gly 

Phe 

Va 1 

Asn 

Leu 




255 





2 6 0 





O £ r; 






35 

AC A 

TTT 

ACC 

AAG 

GAT 

GAA 

GAA 

TCA 

TAT 

TAT 


AGT 

GAA 

GTG 

GGA 

GCC 

966 


Thr 

Phe 

Thr 

Lys 

Asp 

Glu 

Glu 

Ser 

Tyr 

Tyr 

lie 

Ser 

Glu 

Va 1 

Gly 

Ala 



270 





275 





28 0 





285 



TAT 

TTG 

ACC 

GTC 

TCA 

/—» rp 

CCA 

GAG 


ATT 

T 1 

C A 

GGA 

ATC 

AAA 

CAT 

1014 

40 

Tyr 

Leu 

Thr 

Va 1 

Ser 

Asp 

Pro 

Glu 

Thr 

I le 

Tv r 

Gin 

Gly 

He 

Ly s 

His 







2 3 ft 





o c p 





3 00 




GCG 

GTG 

GTG 

ATG 

'prpr - ' 

c~ 'rj 

AC A 

G C A 

GTC 

GGG 

CAT 

TCC 

TTC 

hrtG 

TGC 

GTG 

1062 


Ala 

Val 

Val 

Met 

Phe 

Gin 

Thr 

Ala 

Val 

Gly 

His 

S e r 

Phe 

Ly s 

Cys 

v a 1 


45 




305 





3 10 





3 15 





AGT 

GAA 

CAG 

AGC 

CTC 

CAG 

TTG 

TCA 

GCC 

CAC 

CTG 

CAG 

GTG 

AAA 

ACA 

ACC 

1110 


Ser 

Glu 

Gin 

Ser 

Leu 

Gin 

Leu 

Ser 

i-\ 1 a 

His 

Leu 

G1 n 

Va 1 

Lys 

Thr 

Thr 





320 





325 





3 3 0 





50 

GAT 

GTC 

CAA 

CTT 

CAA 

GCC 

' T> '"T ir T* 

GAT 

TTT 

GAA 

GAT 

GA-v_ 

CAC 

TTT 

GGA 

AAT 

1158 


Asp 

Va 1 

Gin 

Leu 

Gin 

Ala 

Phe 

Asp 

Phe 

Glu 

A s p 

Asp 

His 

Phe 

Gly 

Asn 



335 340 345 



BATES 


ec a: . 



Table 3 (continued): 






GTG GAT GAG 

TGC TCG TCT GAG TAG ACA ATT GTG 

CTT CCT GTC 

• •» mrn , —• c-* 

' M A i 

12 0 6 


Val Asp Glu 

Cys Ser Ser Asp Tyr Thr lie Val 

Leu Pro Val 

lie Gly 


5 

350 

355 350 


365 



GCC ATC GTG 

GTT GGT C'TC TGC CTT ATG GGT ATG 

GGT GTC TAT 

Mi'- 

1254 


Ala lie Va1 

Val Gly Leu Cys Leu Met Gly Met 

Gly Val Tyr 

Lys lie 




3 7 0 

375 


3 30 


10 








CGC CTA AGG 

TGT CAA TCA TCT GGA TAC CAG AGA 

ATC TAATTGTTGC 

1300 


Arg Leu Arg 

Cys Gin Ser Ser Gly Tyr Gin Arg 

He 





335 

3 90 




15 

CCGGGGGGAA 


GGAATTTAGA GAACTCTTTC 

ATCCTTCCAG 

GATGGATGTT 

1360 


GGAAATTCCC 

TCAGAGTGTG 

GGTCCTTCAA ACAATGTAAA 

CC r ^ATC f f T 

CTATTCAAAT 

1420 


GAAGTGAGTC 

ATGTGTGATT 

t^ a■'- j'T't-v--agg c ~GC AC“' t 'CA 

ATTTCTAAAT 

ACTTTTTGTT 

1480 

20 








TATTTTATGA 

AAGATATAGT 

GAGCTGTTTA TTTTCTAGTT 

TCCTTTAGAA 

TATTTTAGCC 

1540 


ACTCAAAGTC 

AACATTTGAG 

ATATGTTGAA TTAAC ATAAT 

ATATGTAAAG 

TAGAATAAGC 

1600 

25 

CTTCAAATTA 

TAAACCAAGG 

GTCAATTGTA ACTAATACTA 

CTGTGTGTGC 

ATTGAAGATT 

1660 


TTATTTTAC C 

CTTGATCTTA 

AC A AAGCC'fT TGCTTTGTTA 

TCAAATGGAC 

TTTCAGTGCT 

1720 


TTTACTATCT 

GTGTTTTATG 

GTTTCATGTA ACATACATAT 

TCCTGGTGTA 

GCACTTAACT 

1780 

30 








CCTTTTCCAC 

TTTAAATTTG 

f^%rr>rjirj>rpy~* ryir-prjnr’T' r~j->ry»y~^ r -~* 

GTTTCACTCT 

TGTCACCCAG 

1840 


GCTGGAGTAC 

AGTGGCACGA 

TCTCGGCTTA TGGCAACCTC 

CGCCTCCCGG 

GTTCAAGTGA 

1900 

35 

TTCTCCTGCT 

TCAGCTTCCC 

GAGTAGCTGG GATTACAGGC 

ACACACTACC 

ACGCCTGGCT 

1960 


AATTTTTGTA 

TTTTTATTAT 

AGACGGGGTT TC ACCATGTT 

GGCCAGACTG 

GTCTTGAACT 

2020 


CTTGACCTCA 

GGTGATCCAC 

C C A C C T C AGC CTC C CAAAG T 

GCTGGGATTA 

C AGG C ATGAG 

2080 

40 








CCATTGCGCC 

CGGCCTTAAA 

rjv"■» rj^rTorT-irT-irnrr*rT-irp * ^ T\J\ 

AAGAAC AAC A 

TATCTC AGGT 

2140 


TGTCTAAGTG 

TTTTTATGTA 

-K T ,. > 

.~i.~H nni MM M*.-a.“ av_-> 

TCAGCTTATA 

TTTTTTATCT 

2200 

45 

TGATGACTCC 

TGCTCCAGAA 

TCGCTAGACT AAGAATTAGG 

TGGCTACAGA 

TGGTAGAACT 

2260 


AAACAATAAG 

C AAG«GrtCAA 

TAATAATGGC CCTTAATTAT 

TAACAAAG TG 

CCAGAGTCTA 

2320 


GGCTAAGCAC 

TTTATCTATA 

TCTCATTTCA TTCTCACAAC 

TTATAGGTGA 

ATGAGTAAAC 

2380 

50 








TGAGACTTAA 

GGGAACTGAA 

TCACTTAAAT GTCACCTGGC 

TAACTGATGG 

CAGAGCCAGA 

2440 


GCTTGAATTC 

ATGTTGGTC T 

GACATCAAGG TCTTTGGTCT 

TCTCCCTACA 

CCAAGTTACC 

2500 

55 

TACAAGAAC A 

ATGAt A-. C A . 

ty TC TGCCTG AAGGCTCACA 

CCTCATACCA 

GCATACGCTC 

2560 
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Table 3 (continued) : 





ACCTTACAGG 

GAAATGGGTT 

TATCCAGGAT 

C ATGAGACAT 


GAAAGGAGAG 

CTTTGCAGAT 

AAC AAAATAG 

CCTATCCTTA 

ATAAATCCTC 

CACTCTCTGG 

AAGGAGACTG 

AGGGGC TTTG 

TAAAACATTA 

GTCAGTTGCT 

C ATTTTTATG 

GGATTGCTTA 


AAAGATGAAG 

GCATCAAATA 

AACTCAAAGT 

ATTTTTAAAT 

TTTTTTGATA 


TTCGCTAACC 

AACTGTTCTT 

TCTTGAGTGA 

TAGCCCCATC 

TTGTGGTAAC 

TTGCTGCTTC 

TGC ACTTC AT 

ATCCATATTT 

CCTATTGTTC 

ACTTTATTCT 

GTAGAGCAGC 

CTGCC AAGAA 

TTTTATTTCT 

GCTGTTTTTT 

TTGCTGCTAA 

n t-\. /"V- J V-. 

TAAGTCAGGA 

TGTTAAC AGA 

AAAGTCCACA 

TAACCCTAGA 

ATTCTTAGTC 

AAGG A. A T 1 .-i . T 1 

TCAAGTCAGC 

CTAGAGACCA 

TGTTGACTTT 

CCTCATGTGT 

TTCCTTATGA 

C T C A G T AA'o T 

TGGCAAGGTC 

CTGACTTTAG 

TCTTAATAAA 

ACATTGAATT 

GTAGTAAAGG 

TTTTTGTAAT 

AAAAACTTAC 

TTTGGAAAAA 

AAAAAAAAAA 

AA 






2620 
2 680 
27 4 0 
2 800 
2860 
2920 
2980 
3040 
3 100 
3 160 
3 172 
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The peptide segments can also be used to produce 
appropriate oligonucleotides to screen a library to 
determine the presence of a similar gene, e.g., an 
identical or polymorphic variant, or to identify a DC. 
genetic code can be used to select appropriate 
oligonucleotides useful as probes for screening. In 
combination with polymerase chain reaction (PCR) 
techniques, synthetic oligonucleotides will be useful in 
selecting desired clones from a library. 

Complementary sequences will also be used as probes or 
primers. Based upon identification of the likely amino 
terminus, other peptides should be particularly useful, 
e.g., coupled with anchored vector or poly-A complementary 
PCR techniques or with complementary DNA of other peptides. 

Techniques for nucleic acid manipulation of genes 
encoding these DC proteins, e.g., subcloning nucleic acid 
sequences encoding polypeptides into expression vectors, 
labeling probes, DMA hybridization, and the like are 
described generally in Sainbrook, et al . (1989) Molecular 

Cloning - A Laboratory Manual (2nd ed.) Vcl. 1-3, Cold 

Spring Harbor Laboratory-', Cold Spring Harbor Press, NY, 
which is incorporated herein by reference and hereinafter 
referred to as "Sarnbrook, et al." See also, Coligan, et 
al . (1987 and periodic supplements; Current Protocols in 

Molecular Biology Greene/Wiley, Mew York, NY, referred to 
as "Coligan, et al." 

There are various methods of isolating the DNA 
sequences encoding these DC proteins. For example, DNA is 
isolated from a genomic or cDMA library-' using labeled 
oligonucleotide probes having sequences identical or 
complementary to the sequences disclosed herein. Full- 
length probes may be used, or oligonucleotide probes may be 
generated by comparison of the sequences disclosed with 
other proteins and selecting specific primers. Such probes 
can be used directly in hybridization assays to isolate DNA 
encoding DC proteins, or probes can be designed for use in 
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amplification techniques such as PCR, for the isolation of 
DNA encoding DC proteins. 

To prepare a cDNA library, mRNA is isolated from cells 
which express the DC protein. cDNA is prepared from the 
mRNA and ligated into a recombinant vector. The vector is 
transfected into a recombinant host for propagation, 
screening and cloning. Methods for making and screening 
cDNA libraries are well known. See Gubler and Hoffman 
(1983) Gene 25:263-269; Sambrook, et al.; or Coligan, et 
al. 

For a genomic library, the DNA can be extracted from 
tissue and either mechanically sheared or enzymatically 
digested to yield fragments of about 12-20 kb. The 
fragments are then separated by gradient centrifugation and 
cloned in bacteriophage lambda vectors. These vectors and 
phage are packaged in vitro, as described, e.g., in 
Sambrook, et al. or Coligan, et al. Recombinant phage are 
analyzed by plaque hybridization as described in Benton and 
Davis (1977) Science 196:180-182. Colony hybridization is 
carried out as generally described in, e.g., Grunstein, et 
al . (1975) Proc. Mat 1 . Acad. Sci. USA 72:3961-3965. 

DNA encoding a DC protein can be identified in either 
cDNA or genomic libraries by its ability to hybridize with 
the nucleic acid probes described herein, for example in 
colony or plaque hybridization experiments. The 
corresponding DMA regions are isolated by standard methods 
familiar to those of skill in the art. See Sambrook, et 
al. 

Various methods of amplifying target sequences, such 
as the polymerase chain reaction, can also be used to 
prepare DNA encoding DC proteins. Polymerase chain 
reaction (PCR) technology is used to amplify such nucleic 
acid sequences directly from mRNA, from cDNA, and from 
genomic libraries or cDNA. libraries. The isolated 
sequences encoding DC proteins may also be used as 
templates for PCR amplification. 
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In PCR techniques, oligonucleotide primers 
complementary to two 5 1 regions in the DMA region to be 
amplified are synthesized. The polymerase chain reaction 
is then carried out using the two primers. See Ir.nis, et 
5 al. (eds.) (1990) PCR Protocols: A Guide to Methods and 

AppI icat ions Academic Press, San Diego, C.A. Primers can be 
selected to amplify the entire regions encoding a selected 
full-length DC protein or to amplify smaller DMA segments 
as desired. Once such regions are PCR-amplified, they can 
10 be sequenced and oligonucleotide probes can be prepared 

from sequence obtained using standard techniques. These 
probes can then be used to isolate DMAs encoding other 
forms of the DC proteins. 

Oligonucleotides for use as probes are chemically 
15 synthesized according to the solid phase phosphoramidite 

triester method first described by Beaucage and Carruthers 
(1983) Tetrahedron Lett. 22(20) :1859-1862, or using an 
automated synthesizer, as described in Needham-VanDevanter, 
et al . (1984) Nucleic Acids Res. 12:6159-6168. 

20 Purification of oligonucleotides is performed e.g., by 

native acrylamide gel electrophoresis or by anion-exchange 
HPLC as described in Pearson and Regnier (1983) J. Chrom. 
255:137-149. The sequence of the synthetic oligonucleotide 
can be verified using the chemical degradation method of 
25 Maxam and Gilbert in Grossman, L. and Moldave (eds.) (1980) 

Methods in Enzymolonv 65:499-560 Academic Press, New York. 

A nucleic acid encoding a human protein comprising two 
ubiquitin domains was isolated and sequenced. This clone 
has been designated A05F12 and the protein is referred to 
30 here as "diubiquitin", exhibiting two ubiquitin domains. 

Its nucleotide sequence and corresponding open reading 
frame are provided in SEQ ID NO: 1 and 2, respectively. 
Counterpart mouse sequence was identified, and described in 
SEQ ID NO: 3 and 4. 

The A05F12 comprises conserved residues characteristic 
of a ubiquitin fold. See Monia, et al. (19901 
BioTechnolony 8:209-215. Proteins of this family have a 
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wide range of roles in the cell, including a non-specific 
ligation of polypeptides and protein-protein dimerization. 
In particular, with regard to the human embodiment, the 165 
amino acid polypeptid.ne could be divided into two 
ubiquitin-1ike domains, separated by 5 amino acids. The N- 
terminal domain, which is less highly conserved (about 
29%), with respect to ubiquitin also has an initial 
extension of 6 amino acids. The C-terminal domain, which 
is more highly conserved (about 36%), contains the terminal 
glycine doublet which is implicated in protein-protein 
interactions. Lysine residues, shown by mutagenesis to be 
important in ubiquitin-protein binding, are conserved in 
both domains. See Monia, et al. (1990) BioTechnology 

8:209-215. The two domains are more closely related to 
ubiquitin than to each other (about 20%), suggesting an 
evolution towards domains with different functions. The 
diubiquitm protein contains 4 cysteine residues, two in 
each domain, atypical of ubiquitin. See, e.g., Bates, et 
al. (1997) Eur. J. Immunol. 27:2471-2477, which was 
published after the priority date of the present 
application. 

A number of proteins are known to contain ubiquitin¬ 
like domains. A protein with two ubiquitin domains (Haas, 
et al. (1987) J. BioI. Chem. 262:11315-11323) known as the 

15 kD interferon induced protein is expressed in response 
to interferon treatment m all cells sensitive to this 
treatment. The 15 kD interferon induced protein has been 
shown to conjugate endogenous cellular polypeptides (Loeb 
and Haas (1992) J. Biol. Chem. 267:7806-7813), and may 
contribute to the cellular response to viral infection. 
Other proteins of interest which contain ubiquitin-like 
domains are: NEDD-8, a mouse cDNA isolated as a neural cell 
protein, but present in many tissues and cell lines (Kumar, 
et al . (19 93) Biochem. Bioohvs. Pes. Comm. 195:393-399) ; 

RAD23A/B, proteins able to complement DMA repair in 
Xeroderma pigmentosum cell lines (Masutani, et al. (1994) 

EMBO J . 13:1831-1843) ; Bat3, a gene localized in the MHC 




class III complex and having a proline-rich domain preceded 
by a ubiquitin-like domain (Banerj:, et al. (1990) Proc . 

Nat- ' 1 Acad . Sci . USA 87:2374-2378), and which may have a 
chaperone function; Monoclonal Non Specific Suppressor 
5 Factor beta (MNSF(3) secreted by mouse and human T cells, the 

product of the faul gene, which produces a predicted 
protein of 133 amino acids with a M-terminus almost 
identical to ubiquitin and a C terminus encoding the 
ribosoma1 protein S30 (Nakamura, et al. (1995) Proc. Mat'1 

10 Acad. Sci. USA 92:3463-3467), and which is cleaved into its 

ubiquitm-1ike and ribosomal parts in the cytoplasm, the 
former being secreted and responsible tor biological 
activity (Nakamura, et al. (1996) J. Immunol. 156:533-538; 

GdX, Vv’hich is also a ubiquitin-like N-terminal fusion gene 

15 of 157 amino acids (Toniolo, et al. (1988) Proc . _Nat '1 

Acad. Sci. USA 85:851-855). In many of these proteins, the 
ubiquitin-like domain is essential tor activity, and may be 
a dimerization domain for RAD23A/B, Bat3, and GdX. 

Diubiquitin has an N-terminal closest in homology to 
2-0 Rad23A/B and Bat3 and a C-terminal with greatest homology 

to GdX and NEDD-8. These homologies might suggest a role 
in dimerization for the first domain, and with the glycine 
terminal doublet, a role in polypeptice binding for the 
second domain. Genomic analysis suggests the gene is 
25 single copy, and is mapped to the HXA-F region of 

chromosome 6. 

A second human DC clone was isolated, designated 
A07C03, is a member of the Ig domain superfamily of 
proteins. This protein is referred to herein as an Ig- 
30 family member and is described m SEQ ID NO: 5 and 6. The 

sequence was verified, and is disclosed in SEQ ID NO: 7 and 
8. A mouse counterpart is described in SEQ ID NO: 9 and 
10 . 

The respective Met codons for the mouse and human sequences 
are in a region of DMA with homology to the consensus Kozack 
sequence and are positioned identically on the protein sequence; 
significant homology between amine acids is quite striking 
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downstream. The initiation methionines are correctly positioned 
with respect to the Ig-foid, leaving a short N-termmal region 
which may help to determine the specificity of the receptor. 
However, there are no stop codons immediately upstream, and only 
a short 3' UTR is seen upstream of the putative Met codon. 

Important motifs include characteristic cysteines in the 
human at about residues 29 and 96, and in mouse at residues 29 
and 96; a J chain region in human from about glyll2 to valll8 or 
glul21, and in mouse from about glyll2 to phel54; and 
characteristic intracellular tyrosine residues in human at about 
189 and 207, and in mouse at about 187, 191, and 204. 

A third DC protein clone was isolated and designated 
E02B02, which is a member of the LAMP family. It is 
described in SEQ ID NO: 11 and 12. The message is weakly 
expressed in human cord blood progenitors cultured in the 
presence of GM-CSF and TMFo. into dendritic ceils, at the 6 
day stage. In contrast, at days 12-16, when precursors 
mature into dendritic cells with typical DC morphology and 
phenotype, large amounts of message are detected. PCR 
analysis detected expression also in Lnagehans cells, but 
not in a population of basal cells containing mostly 
keratinocytes. PMA-ionomycm activated macrophages 
generated in vitro from CD34+ progenitors cultured with M- 
CSF express the message. E02B02 expression is upregulated 
after CD40L activation in monocyte-derived dendritic cells, 
as well as in CD4 -t-CDl lc + CD3 - dendritic cells isolated ex 
vivo from tonsillar germinal centers. 

This invention provides isolated DMA or fragments to 
encode a DC protein, as described. In addition, this 
invention provides isolated or recombinant DMA which 
encodes a biologically active protein or polypeptide which 
is capable of hybridising under appropriate conditions, 
e.g., high stringency, with the DMA sequences described 
herein. Said biologically active protein or polypeptide 
can be a naturally occurring form, or a recombinant protein 
or fragment, and have an amino acid sequence as disclosed 
in SEQ ID NO: 2, 4, 6, 3, 10, or 12. Preferred embodiments 
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will be full length natural isolates, e.g., from a primate 
or rodent. In glycosylated form, the proteins should 
exhibit larger sizes. Further, this invention encompasses 
the use of isolated or recombinant DMA, or fragments 
thereof, which encode proteins which are homologous to each 
respective DC protein. The isolated DMA. can have the 
respective regulatory sequences in the 5' and 3' flanks, 
e.g., promoters, enhancers, poly-A addition signals, and 
others. 


IV. Making DC Gene Products 

DN.As which encode these DC proteins or fragments 
thereof can be obtained by chemical synthesis, screening 
cDNA libraries, or by screening genomic libraries prepared 
from a wide variety of cell lines or tissue samples. 

These DMAs can be expressed m a ’wide variety of host 
cells for the synthesis of a full-length protein or 
fragments which can, e.g., be used to generate polyclonal 
or monoclonal antibodies; for binding studies; for 
construction and expression of modified molecules; and for 
structure/function studies. Each of these DC proteins or 
their fragments can be expressed in host cells that are 
transformed or transfected with appropriate expression 
vectors. These molecules can be substantially purified to 
be free of protein or cellular contaminants, other than 
those derived from the recombinant host, and therefore are 
particularly useful in pharmaceutical compositions when 
combined with a pharmaceutically acceptable carrier and/or 
diluent. The antigen, or portions thereof, may be 
expressed as fusions with other proteins. 

Expression vectors are typically self-replicating DNA 
or RNA constructs containing the desired DC gene or its 
fragments, usually operably linked to suitable genetic 
control elements that are recognized in a suitable host 
cell. These control elements are capable of effecting 

specific type of 
pression will depend 
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control elements necessary to effect 
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upon the eventual host cell used. Generally, the genetic 
control elements can include a prokaryotic promoter system 
or a eukaryotic promoter expression control system, and 
typically include a transcriptional promoter, an optional 
5 operator to control the onset of transcription, 

transcription enhancers to elevate the level of mRNA 
expression, a sequence that encodes a suitable ribosome 
binding site, and sequences that terminate transcription 
and translation. Expression vectors also usually' contain 
10 an origin of replication that allows the vector to 

replicate independently' from the host cell. 

The vectors of this invention contain DNAs which 
encode the various DC proteins, or a fragment thereof, 
typically encoding, e.g., a biologically active 
IS polypeptide, or protein. The DNA can be under the control 

of a viral promoter and can encode a selection marker. 

This invention further contemplates use of such expression 
vectors which are capable of expressing eukaryotic cDNA 
coding for a DC protein in a prokaryotic or eukaryotic 
20 host, where the vector is compatible with the host and 

where the eukaryotic cDNA coding for the protein is 
inserted into the vector such that growth of the host 
containing the vector expresses the cDNA in question. 
Usually, expression vectors are designed for stable 
25 replication in their host cells or for amplification to 

greatly increase the total number of copies of the 
desirable gene per cell. It is not a 1 way's necessary to 
require that an expression vector replicate in a host cell, 
e.g., it is possible to effect transient expression of the 
30 protein or its fragments in various hosts using vectors 

that do not contain a replication origin that is recognized 
by the host cell. It is also possible to use vectors that 
cause integration of a DC gene or its fragments into the 
host DNA by recombination, or to integrate a promoter which 
35 controls expression of an endogenous gene. 

Vectors, as used herein, comprise plasmids, viruses, 
bacteriophage, ir.tegratable DMA fragments, 


and other 
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vehicles which enable the integration of DMA fragments into 
the genome of the host. Expression vectors are specialised 
vectors which contain genetic control elements that effect 
expression of operably linked genes. Plasmids are the most 
5 commonly used form of vector but all other forms of vectors 

which serve an equivalent function are suitable for use 
herein. See, e.g., Pouwels, et al. (1585 and Supplements) 

Cloning Vectors : A Laboratory/ Manual Elsevier, N.Y.; and 
Rodriquez, et al. (eds.) (1988) Vectors: A Survey of 

10 Molecular Cloning Vectors and Their Uses Buttersworth, 

Boston, MA.. 

Suitable host cells include prokaryotes, lower 
eukaryotes, and higher eukaryotes. Prokaryotes include 
both gram negative and gram positive organisms, e.g., E. 

15 coli and B. subtilis. Lower eukaryotes include yeasts, 

e.g., S. cerevisiae and Pichia, and species of the genus 
Dictyostelium. Higher eukaryotes include established 
tissue culture cell lines from animal cells, both of 
non-mammalian origin, e.g., insect cells, and birds, and of 
20 mammalian origin, e.g., human, primates, and rodents. 

Prokaryotic host-vector systems include a wide variety 
of vectors for many different species. As used herein, E. 
coli and its vectors will be used generically to include 
equivalent vectors used in other prokaryotes. A 
25 representative vector for amplifying DMA is pBR322 or its 

derivatives. Vectors that can be used to express DC 
proteins or fragments include, but are not limited to, such 
vectors as those containing the lac promoter (p’JC-series) ; 
trp promoter (pBR3 22-trpi ; Ipp promoter (the pIN-senes) ; 

30 lambda-pP or pR promoters (pOTS); or hybrid promoters such 

as ptac (pDR540) . See Brosius, et al. (1988) "Expression 
Vectors Employing Lambda-, trp-, lac-, and Ipp-derived 
Promoters", in Rodriguez and Denhardt (eds.) Vectors: A 
Survey of Molecular Cloning Vectors and Their Uses 10:205- 
35 236 Buttersworth, Boston, MA.. 

Lower eukaryotes, e.g., yeasts and Dictyostelium, may 
with DC gene sequence containing vectors. 


be transformed 
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For purposes of this invention, the most common lower 
eukaryotic host is the baker's yeast, Saccharomyces 
cerevisiae. It will be used genericallv to represent lower 
eukaryotes although a number of other strains and species 
5 are also available. Yeast vectors typically consist of a 

replication origin (unless of the integrating type), a 
selection gene, a promoter, DMA encoding the desired 
protein or its fragments, and sequences for translation 
termination, polyadenylation, and transcription 
10 termination. Suitable expression vectors for yeast include 

such constitutive promoters as 3-phosphoglycerate kinase 
and various other glycolytic enzyme gene promoters or such 
inducible promoters as the alcohol dehydrogenase 2 promoter 
or metallothionine promoter. Suitable vectors include 
15 derivatives of the following types: seif-replicating low 

copy number (such as the YRp-series), self-replicating high 
copy number (such as the YEp-senes); integrating types 
(such as the Yip-series), or mini-chromosomes (such as the 
YCp-senes ) . 

20 Higher eukaryotic tissue culture ceils are the 

preferred host cells for expression of the DC protein. In 
principle, most any higher eukaryotic tissue culture cell 
line may be used, e.g., insect baculovirus expression 
systems, whether from an invertebrate or vertebrate source. 
25 However, mammalian cells are preferred to achieve proper 

processing, both cotranslatlonaliy and posttranslationally. 
Transformation or transfection and propagation of such 
cells is routine. Useful cell lines include HeLa cells, 
Chinese hamster ovary !CHO ! ceil lines, baby rat kidney 
30 {BRK) cell lines, insect cell lines, bird cell lines, and 

monkey (COS) cell lines. Expression vectors for such cell 
lines usually include an origin of replication, a promoter, 
a translation initiation site, RMA splice sites (e.g., if 
genomic DMA is used), a polyadenylation site, and a 
35 transcription termination site. These vectors also may 

contain a selection gene or amplification gene. Suitable 
expression vectors may be plasmids, viruses, or 



retroviruses carrying promoters derived, e.g., from such 
sources as from adenovirus, SV40, parvoviruses, vaccinia 
virus, or cytomegalovirus. Representative examples of 
suitable expression vectors include pCDMAl; pCD, see 
Okayama, et al. (1985) Mol. Cell Biol. 5:1136-1142; pMClneo 

Poly-A, see Thomas, et al. (1987) Cell 51:503-512; and a 

baculovirus vector such as pAC 373 or pAC 610. 

In certain instances, the DC proteins need not be 
glycosylated to elicit biological responses in certain 
assays. However, it will often be desirable to express a 
DC polypeptide in a system which provides a specific or 
defined glycosylation pattern. In this case, the usual 
pattern will be that provided naturally by the expression 
system. However, the pattern will be modifiable by 
exposing the polypeptide, e.g., in unglycosylated form, to 
appropriate glycosylating proteins introduced into a 
heterologous expression system. For example, a DC gene may 
be co-transformed with one or more genes encoding mammalian 
or other glycosylating enzymes. It is further understood 
that over glycosylation may be detrimental to DC protein 
biological activity, and that one of skill may perform 
routine testing to optimize the degree of giycosylation 
which confers optimal biological activity. 

A DC protein, or a fragment thereof, may be engineered 
to be phosphatidyl inositol (PI) linked to a cell membrane, 
but can be removed from membranes by treatment with a 
phosphatidyl inositol cleaving enzyme, e.g., phosphatidyl 

inositol phosphoiipase-C. This releases the antigen in a 
biologically active form, and allows purification by 
standard procedures of protein chemistry. See, e.g.. Low 
(1989) Biochem. Rioohvs. Acta 9 88:427-454; Tse, et. al . 

(1985) Science 2 3 0:1003-1008; Brunner, et al . (1991) J. 

Cell Biol. 114:1275-1283; and Coligan, et al. (eds.) (1996 
and periodic supplements) Current Pro^ols in Protein 
Science . John Wiley & Sons, New York, MY. 

Now that these DC proteins have been characterized, 
fragments or derivatives thereof car: be prepared by 
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conventional processes for synthesizing peptides. These 
include processes such as are described in Stewart and 
Young (1984) Solid Phase Peptide Synthesis Pierce Chemical 
Co., Rockford, IL; Bodanszky and Bodanszky (1984) The 
Practice of Peotide Synthesis Springer-Verlag, New York, 

NY; and Bodanszky (1984) The Principles of Peotide 
Svnthesis Springer-Verlag, New York, NY. See also 
Merrifield (1986) Science 232:341-347; and Dawson, et al. 
(1994) Science 266:776-779. For example, an azide process, 
an acid chloride process, an acid anhydride process, a 
mixed anhydride process, an active ester process 1 for 
example, p-mtropnenyl ester, N-hydroxysuccinunide ester, 
or cyanomethyl ester), a carbodiimidazole process, an 
oxidative-reductive process, or a dicyclohexyIcarbodiimide 
(DCCD)/additive process can be used. Solid phase and 
solution phase syntheses are both applicable to the 
foregoing processes . 

The prepared protein and fragments thereof can be 
isolated and purified from the reaction mixture by means of 
peptide separation, for example, by extraction, 
precipitation, electrophoresis and various forms of 
chromatography, and the like. The DC proteins of this 
invention can be obtained in Varying degrees of purity 
depending upon the desired use. Purification can be 
accomplished by use of known protein purification 
techniques or by the use of the antibodies or binding 
partners herein described, e.g., in immunoabsorbant 
affinity chromatography. This immunoabsorbant affinity 
chromatography is carried out by first, linking the 
antibodies to a solid support and contacting the linked 
antibodies 'with solubilized lysates of appropriate source 
cells, lysates of other cells expressing the protein, or 
lysates or supernatants of cells producing the proteins as 
a result of DNA techniques, see below. 

Multiple cell lines may be screened for one which 
expresses said protein at a high level compared with other 
cells. Various ceil lines, e.g., a mouse thymic stromal 
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cell line TA4, is screened and selected for its favorable 
handling properties. Natural DC cell proteins can be 
isolated from natural sources, or by expression from a 
transformed cell using an appropriate expression vector. 

5 Purification of the expressed protein is achieved by 

standard procedures, or may be combined with engineered 
means for effective purification at high efficiency from 
cell lysates or supernatants. FLAG or Hisg segments can be 

used for such purification features. 

10 

V. Antibodies 

Antibodies can be raised to the various DC proteins, 
including individual, polymorphic, allelic, strain, or 
species variants, and fragments thereof, both in their 

15 naturally occurring (full-length) forms and in their 

recombinant forms. Additionally, antibodies can be raised 
to DC proteins in either their active forms or in their 
inactive forms. Anti-idiotypic antibodies may also be 
used. 

20 a. Antibody Production 

A number of immunogens may be used to produce 
antibodies specifically reactive with these DC proteins. 
Recombinant protein is the preferred immunogen for the 
production of monoclonal or polyclonal antibodies. 

25 Naturally occurring protein may also be used either in pure 

or impure form. Synthetic peptides made using the human DC 
protein sequences described herein may also used as an 
immunogen for the production of antibodies to the DC 
protein. Recombinant protein can be expressed in 

30 eukaryotic or prokaryotic cells as described herein, and 

purified as described. The product, is then injected into 
an animal capable of producing antibodies. Either 
monoclonal or polyclonal antibodies may be generated for 
subsequent use in immunoassays to measure the protein. 

35 Methods of producing polyclonal antibodies are known 

to those of skill in the art. In brief, an immunogen, 
preferably a purified protein, is mixed with an adjuvant 
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and animals are immunized with the mixture, 
immune response to the immunogen preparation is monitored 
by taking test bleeds and determining the titer of 
reactivity to the DC protein of interest. When 
appropriately high titers of antibody to the immunogen are 
obtained, blood is collected from the animal and antisera 
are prepared. Further fractionation of the antisera to 
enrich for antibodies reactive to the protein can be done 
if desired. See, e.g., Harlow and Lane. 

Monoclonal antibodies may be obtained by various 
techniques familiar to those skilled m the art. Briefly, 
spleen cells from an animal immunized with a desired 
antigen are immortalized, commonly by fusion with a myeloma 
cell. See, e.g., Kohler and Milsteiri (1976) Fur . J. 

Immunol. 6:511-519, which is incorporated herein by 

reference. Alternative methods of immortalization include 
transformation with Epstein Barr Virus, oncogenes, or 
retroviruses, or other methods known in the art. Colonies 
arising from single immortalized cells are screened for 
production of antibodies of the desired specificity and 
affinity for the antigen, and yield of the monoclonal 
antibodies produced by such ceils may be enhanced by 
various techniques, including injection into the peritoneal 
cavity of a vertebrate host. Alternatively, one may 
isolate DMA sequences which encode a monoclonal antibody or 
a binding fragment thereof by screening a DMA library from 
human B cells according to the general protocol outlined by 
Huse, et al. (1989! Eg i en~-e 246:1275-1281. 

Antibodies, including binding fragments and single 
chain versions, against predetermined fragments of these DC 
proteins can be raised by immunization of animals with 
conjugates of the fragments -with carrier proteins as 
described above. Monoclonal antibodies are prepared from 
cells secreting the desired antibody. These antibodies can 
be screened for binding to normal or defective DC proteins, 
or screened for agonistic or antagonistic activity. These 
monoclonal antibodies will usually bind -with at least a Kd 




of about 1 mM, more usually at least about (JM, 

typically at least about 10 fiM, more typically at least 
about 30 fiM, preferably at least about 10 |lM, and more 
preferably at least about 3 pM or better. 

In some instances, it is desirable to prepare 
monoclonal antibodies from various mammalian hosts, such as 
mice, rodents, primates, humans, etc. Description of 
techniques for preparing such monoclonal antibodies may be 
found in, e.g., Stites, et al. (eds.) Basic and Clinical 
Immunology (4th ed.) Lange Medical Publications, Los Altos, 
CA, and references cited therein; Harlow and Lane (1988) 
Antibodies: A Laboratory Manual CSH Press; Coding (1986) 

Monoclonal Antibodies: Principles and Practice (2d ed.) 

Academic Press, New York, MY; and particularly in Kohler 
and Milstein (1575) Natnre 256:495-497, which discusses one 
method of generating monoclonal antiooaies. Summarised 
briefly, this method involves injecting an animal with an 
immunogen to initiate a humoral immune response. The 
animal is then sacrificed and cells taken from its spleen, 
which are then fused with myeloma cells. The result is a 
hybrid cell or "hybridoma" that is capable of reproducing 
in vitro. The population of hybndorr.as is then screened to 
isolate individual clones, each of which secretes a single 
antibody species to the immunogen. In this manner, the 
individual antibody species obtained are the products of 
immortalized and cloned single B cells from the immune 
animal generated in response to a specific site recognized 
on the immunogenic substance. 

Other suitable techniques involve selection of 
libraries of antibodies in phage or similar vectors. See, 
Huse, et al. (1989) "Generation of a Large Combinatorial 

Library^ of the Immunoglobulin Repertoire in Phage Lambda," 
Science 246:1275-1281; and Ward, et al. (1989) Mature 

341:544-546. The polypeptides and antibodies of the 
present invention may be used v.uth or without modification, 
including chimeric or humanized antibodies. Frequently, 
the polypeptides and antibodies will be labeled by joining. 
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either covalently or non-covalencly, a substance which 
provides for a detectable signal. A wide variety of labels 
and conjugation techniques are known and are reported 
extensively in both the scientific and patent literature. 

5 Suitable labels include radionuclides, enzyrr.es, substrates, 

cofactors, inhibitors, fluorescent moieties, 
chemiluminescent moieties, magnetic particles, and the 
like. Patents, teaching the use of such labels include 
U.S. Patent Nos. 3,817,837; 3,850,752; 3,539,350; 

10 3,996,345; 4,277,437; 4,275,149; and 4,366,241. Also, 

recombinant immunoglobulins may be produced. See, Cabilly, 
U.S. Patent No. 4,816,567; and Queer:, et al . (1989) Proc „ 

Nat. ' 1 Acad. Sci. USA 8 6:10029-10033. 

The antibodies of this invention can also be used for 
15 affinity chromatography in isolating each DC protein. 

Columns can be prepared where the antibodies are linked to 
a solid support, e.g., particles, such as agarose, 

SEPHADEX, or the like, where a cell lysate may be passed 
through the column, the column -washed, followed by 
20 increasing concentrations of a mile, denaturant, whereby 

purified DC protein -will be released. 

The antibodies may also be used to screen expression 
libraries for particular expression products. Usually the 
antibodies used in such a procedure will be labeled with a 
25 moiety allowing easy detection of presence of antigen by 

antibody binding. 

Antibodies to DC proteins may be used for the analysis 
or, or identification of specific cell population 
components which express the respective protein. By 
30 assaying the expression products ot cells expressing DC 

proteins it is possible to diagnose disease, e.g., immune- 
compromised conditions, DC depleted conditions, or 
overproduction of DC. 

Antibodies raised against eacn DC will also be useful 
to raise anti-idiotypic antibodies. These will be useful 
in detecting or diagnosing various immunological conditions 
related to expression of the respective antigens. 
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b. Immunoassays 

A particular protein can be measured by a variety of 
immunoassay methods. For a review of immunological and 
immunoassay procedures in general, see Stites and Terr 
(eds.) 1991 Basic and Clinical Immunology (7th ed.). 

Moreover, the immunoassays of the present invention can be 
performed in any of several configurations, which are 
reviewed extensively in Maggio (ed.) (1980) Enzyme 

Immunoassay CRC Press,Boca Raton, Florida; Tijan (1985) 
"Practice and Theory of Enzyme Immunoassays," Laboratory 7 
Techniques in Biochemistry 7 and Molecular Biology , Elsevier 
Science Publishers B.V., Amsterdam; and Harlow and Lane 
Antibodies. A Laboratory 7 Manual , supra, each of which is 
incorporated herein by reference. See also Chan (ed.) 

(1987) Immunoassay. A Practical Guide Academic Press, 

Orlando, FL; Price and Newman (eds.) (1991) Principles _and 

Practice of Immunoassays Stockton Press, NY; and Ngo (ed.) 
(1988) Non -isot opic Immunoassays Plenum Press, NY. 

Immunoassays for measurement of these DC proteins can 
be performed by a variety of methods known to those skilled 
in the art. In brief, immunoassays to measure the protein 
can be competitive or noncompetitive binding assays. In 
competitive binding assays, the sample to be analyzed 
competes with a labeled analyte for specific binding sites 
on a capture agent bound to a so*id surface. Preferably 
the capture agent, is an antibody specifically reactive with 
the DC protein produced as described above. The 
concentration of labeled analyte bound to the capture agent 
is inversely proportional to the amount of free analyte 
present in the sample. 

In a competitive binding immunoassay, the DC protein 
present in the sample competes with labeled protein for 
binding to a specific binding agent, for example, an 
antibody specifically reactive with the DC protein. The 
binding agent may be bound to a solid surface to effect 
separation of bound labeled protein from the unbound 
labeled protein. Alternately, the compefitive binding 
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assay may be conducted in liquid phase and any of a variety 
of techniques known in the art may be used to separate the 
bound labeled protein from the unbound labeled protein. 
Following separation, the amount of bound labeled protein 
5 is determined. The amount of protein present in the sample 

is inversely proportional to the amount of labeled protein 
binding. 

Alternatively, a homogenous immunoassay may be 
performed in which a separation step is not needed. In 
10 these immunoassays, the label on the protein is altered by 

the binding of the protein to its specific binding agent. 
This alteration in the labelled protein results in a 
decrease or increase in the signal emitted by' label, so 
that measurement of the label at the end of the immunoassay 
15 allows for detection or quantitation of the protein. 

These DC proteins may' also be quantitatively' 
determined by a variety of noncompetitive immunoassay 
methods. For example, a two-site, solid phase sandwich 
immunoassay may be used. In this type of assay', a binding 
20 agent for the protein, for example an antibody', is attached 

to a solid support. A second protein binding agent, which 
may also be an antibody, and which binds the protein at a 
different site, is labeled. After binding at both sites on 
the protein has occurred, the unbound labeled binding agent 
25 is removed and the amount of labeled binding agent bound to 

the solid phase is measured. The amount of labeled binding 
agent bound is directly proportional to the amount of 
protein in the sample. 

Western blot analysis can be used to determine the 
30 presence of DC proteins in a sample. Electrophoresis is 

carried out, e.g., on a tissue sample suspected of 
containing the protein. Following electrophoresis to 
separate the proteins, and transfer of the proteins to a 
suitable solid support such as a nitrocellulose filter, the 
solid support is incubated with an antibody reactive with 
the denatured protein. This antibody may be labeled, or 
alternatively may be it may be detected by subsequent 
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incubation with a second labeled antibody that binds the 
primary antibody. 

The immunoassay formats described above employ labeled 
assay components. The label can be in a variety of forms. 
The label may be coupled directly or indirectly to the 
desired component of the assay according to methods well 
known in the art. A wide variety of labels may be used. 

The component may be labeled by any one of several methods. 
Traditionally a radioactive label incorporating 3 h, 125 ]y 

35s, 14 C, or 32p i s used. Non-radioactive labels include 

ligands which bind to labeled antibodies, fluorophores, 
chemiluminescent agents, enzymes, and antibodies 'which can 
serve as specific binding pair members for a labeled 
protein. The choice of label depends on sensitivity 
required, ease of conjugation -with the compound, stability 
requirements, and available instrumentation. For a review 
of various labeling or signal producing systems which may 
be used, see U.S. Patent No. 4,391,304, which is 
incorporated herein by reference. 

Antibodies reactive with a particular protein can also 
be measured by a variety of immunoassay methods. For 
reviews of immunological and immunoassay procedures 
applicable to the measurement of antibodies by immunoassay 

techniques, see, e.g., Stites and Terr (eds.) Basic _ and 

Clinical Immunology (7th ed. ! supra; Kaggio {ed.) Encyme 
Immunoassay , supra; and Harlow and Lane Antibodies, A 
Laboratory Manual , supra. 


A variety of different immunoassay formats, separation 
techniques, and labels can be also be used similar to those 
described above for the measurement of specific proteins. 


VI. Purified DC proteins 

The human DC diubiquitin protein amino acid sequence 
is provided in SEQ ID MO: 2. Mouse sequence is provided in 
SEQ ID NO: 4. Human nucleotide and amino acid sequences 
for the Ig-family member are provided in SEQ ID NO: 5, 6, 

7, 8, 9, and 10. The LAMP family member from human, 
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designated E023Q2, is described m SEQ IB Nu: and 12. 

The peptide sequences allow preparation or peptides to 
generate antibodies to recognize such segments, and allow 
preparation of oligonucleotides which encode such 
5 sequences. Peptides have many other uses, e.g., to 

immunopurify antibodies, as agonists or antagonists of the 
natural forms, for structural studies, etc. 

VII. Physical Variants 

10 This invention also encompasses proteins or peptides 

having substantial ammo acid sequence similarity with an 
amino acid sequence of a SEQ IB MO: 2, 4, 6, 8, 10, or 12. 

Variants exhibiting substitutions, e.g., 20 or fewer, 

preferably 10 or fewer, and more preferably 5 or fewer 
15 substitutions, are also enabled. Where the substitutions 

are conservative substitutions, the variants will share 
immunogenic or antigenic similarity or cross-reactivity 
with a corresponding natural sequence protein. Natural 
variants include individual, allelic, polymorphic, strain, 
20 or species variants. 

Amino acid sequence similarity, or sequence identity, 
is determined by optimizing residue matches, if necessary, 
by introducing gaps as required. This changes when 
considering conservative substitutions as matches. 

25 Conservative substitutions typically include substitutions 

within the following groups: glycine, alanine; valine, 
isoleucine, leucine; aspartic acid, glutamic acid; 
asparagine, glutamine; serine, threonine; lysine, arginine 
and phenylalanine, tyrosine. Homologous ammo acid 
30 sequences include natural allelic and mterspecies 

variations in each respective protein sequence. Typical 
homologous proteins or peptides will have from 50-100% 
similarity (if gaps car. be introduced), to 75-100% 
similarity (if conservative substitutions are included) 

35 with the amino acid sequence of the relevant DC protein. 

Identity measures will be at least about 50%, generally at 
least 60%, more generally at least 6 5%., usually at least 
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70%, more usually at least 75%, preferably at least 80%, 
and more preferably at least 80%, ar.d m particularly 
preferred embodiments, at least 85% or more. See also 
Needleham, et ai. (1970) J. Mol. Biol. 48:443-453; Sankoff, 

et al. (1983) Time Warps, String Edits, and Macrcmolecules: 

The Theor\' and Practice of Sequence Comparison Chapter One, 
Addison-Wesley, Reading, MA; and software packages from 
IntelliGenetics, Mountain View, CA; and the University of 
Wisconsin Genetics Computer Group (GCG) , Madison, WI. 

Nucleic acids encoding the corresponding mammalian DC 
proteins will typically hybridize to SEQ ID NO 1, 3, 5, 7, 

9, or 11 under stringent conditions. For example, nucleic 
acids encoding the respective DC proteins will typically 
hybridize to the nucleic acid of SEQ ID NO 1, 3, 5, 7, 9, 

or 11 under stringent hybridization conditions, while 
providing few false positive hybridization signals. 
Generally, stringent conditions are selected to be about 
10" C lower than the thermal melting point (Tm) for the 
sequence being hybridized to at a defined ionic strength 
and pH. The Tm is the temperature (under defined ionic 
strength and pH) at which 50% of the target sequence 
hybridizes to a perfectly matched probe. Typically, 
stringent conditions will be those in which the salt 
concentration in wash is about. 0.02 molar at pH 7 and the 
temperature is at least about 50" C. Other factors may 
significantly affect the stringency of hybridization, 
including, among others, base composit.ion and size of the 
complementary strands, the presence of organic solvents 
such as formamide, and the extent of base mismatching. A 
preferred embodiment will include nucleic acids which will 
bind to disclosed sequences in 50% formamide and 20-50 mM 
NaCl at 42" C. 

.An isolated DC gene DMA can be readily modified by 
nucleotide substitutions, nucleotide deletions, nucleotide 
insertions, and inversions of nucleotide stretches. These 
modifications result in novel DMA sequences which encode 
these DC antigens, their derivatives, or proteins having 
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highly similar physiological, immunogenic, or antigenic 
activity. 

Modified sequences can be used to produce mutant 
antigens or to enhance expression. Enhanced expression may 
involve gene amplification, increased transcription, 
increased translation, and other mechanisms. Such mutant 
DC protein derivatives include predetermined or site- 
specific mutations of the respective protein or its 
fragments. "Mutant DC protein" encompasses a polypeptide 
otherwise falling within the homology definition of the DC 
protein as set forth above, but having an amino acid 
sequence which differs from that of the DC protein as found 
in nature, whether by way of deletion, substitution, or 
insertion. In particular, "site specific mutant DC 
protein" generally includes proteins having significant 
similarity with a protein having a sequence of SEQ ID NO: 

2, 4, 6, 8, 10, or 12. Generally, the variant will share 

many physicochemical and biological activities, e.g., 
antigenic or immunogenic, with those sequences, and in 
preferred embodiments contain most or all of the disclosed 
sequence. Similar concepts apply to these various DC 
proteins, particularly those found in various warm blooded 
animals, e.g., primates and mammals. 

Although site specific mutation sites are 
predetermined, mutants need not be site specific. DC 
protein mutagenesis can be conducted by making amino acid 
insertions or deletions. Substitutions, deletions, 
insertions, or any combinations may be generated to arrive 
at a final construct. Insertions include amino- or 
carboxyl- terminal fusions. Random mutagenesis can be 
conducted at a target codon and the expressed mutants can 
then be screened for the desired activity. Methods for 
making substitution mutations at predetermined sites in DNA 
having a known sequence are well known in the art, e.g., by 
Ml3 primer mutagenesis or polymerase chain reaction (PCR) 
techniques. See also, Sambrook, et al. (1989) and Ausubel, 
et al . (1987 and Supplements; . The murat ions in the DNA 
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normally should not place coding sequences out of reading 
frames and preferably will not create complementary' regions 
that could hybridize to produce secondary’ mRNA structure 
such as loops or hairpins. 

The present invention also provides recombinant 
proteins, e.g., heterologous fusion proteins using segments 
from these proteins. A heterologous fusion protein is a 
fusion of proteins or segments which are naturally not 
normally fused in the same manner. Thus, the fusion 
product of an immunoglobulin with a respective DC 
polypeptide is a continuous protein molecule having 
sequences fused in a typical peptide linkage, typically 
made as a single translation product and exhibiting 
properties derived from each source peptide. A similar 
concept applies to heterologous nucleic acid sequences. 

In addition, new constructs rnay be made from combining 
similar functional domains from other proteins. For 
example, domains or other segments may be "swapped” between 
different new fusion polypeptides or fragments, typically 
with related proteins, e.g., within the Ig family or the 
LAMP family. Preferably, intact structural domains will be 
used, e.g., intact Ig portions. See, e.g., Cunningham, et 
al . (1989) Science 243:1330-1336; and O'Dowd, et al. (1988) 

J, Biol. Chem. 263:15985-15992. Thus, new chimeric 
polypeptides exhibiting new combinations of specificities 
will result from the functional linkage of protein-binding 
specificities and other functional domains. Also, alanine 
scanning mutagenesis may be applied, preferably to residues 
which structurally are exterior to the secondary structure, 
which will avoid most of the critical residues which 
generally disrupt tertiary structure. 

"Derivatives" of these DC antigens include amino acid 
sequence mutants, giycosylation variants, and covalent or 
aggregate conjugates with other chemical moieties. 

Covalent derivatives can be prepared by linkage of 
functionalities to groups which are found in these DC 
protein amino acid side chains or sc d/r M- or C- termini, 
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by means which are well known in the art. These 
derivatives can include, without limitation, aliphatic 
esters or amides of the carboxyl terminus, or of residues 
containing carboxyl side chains, O-acyl derivatives of 
5 hydroxyl group-containing residues, and N-acyl derivatives 

of the amino terminal amino acid or amino-group containing 
residues, e.g., lysine or arginine. Acyl groups are 
selected from the group of alkyl-moieties including C3 to 
C18 normal alkyl, thereby forming alkanoyl aroyl species. 

10 Covalent attachment to carrier proteins may be important 

when immunogenic moieties are haptens. 

In particular, glvcosy lat. ion alterations are included, 
e.g., made by modifying the glycosylation patterns of a 
polypeptide during its synthesis and processing, or in 
15 further processing steps. Particularly preferred means for 

accomplishing this are by exposing the polypeptide to 
glycosylating enzymes derived from cells which normally 
provide such processing, e.g., mammalian glycosylation 
enzymes. Deglycosylation enzymes are also contemplated. 

20 Also embraced are versions of the same primary ammo acid 

sequence which have other minor modifications, including 
phosphorylated amino acid residues, e.g., phosphotyrosine, 
phosphoserine, or phosphothreonine, or other moieties, 
including ribosyl groups or cross-linking reagents. Also, 
25 proteins comprising substitutions are encompassed, which 

should retain substantial immunogenicity, to produce 
antibodies which recognize a protein of SEQ IB NO: 2, 4, 6, 

8, 10, or 12. Typically, these proteins will contain less 

than 20 residue substitutions from the disclosed sequence, 
30 more typically less than 10 substitutions, preferably less 

than 5, and more preferably less than three. 

Alternatively, proteins 'which begin and end at structural 
domains will usually retain antigenicity and cross 
immunogen i c i t.y . 

A major group of derivatives are covalent, conjugates 
of the DC proteins or fragments thereof with other proteins 
or polypeptides. These derivatives can be synthesized in 
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the use of agents known in the art tor tneir usefulness in 
cross-linking proteins through reactive side groups. 
Preferred protein derivatization sites with cross-1inking 
agents are at free amino groups, carbohydrate moieties, and 
cysteine residues. 

Fusion polypeptides between these DC proteins and 
other homologous or heterologous proteins are also 
provided. Heterologous polypeptides may be fusions between 
different surface markers, resulting in, e.g., a hybrid 
protein. Likewise, heterologous fusions may be constructed 
which would exhibit a combination of properties or 
activities of the derivative proteins. Typical examples 
are fusions of a reporter polypeptide, e.g., luciferase, 
with a segment or domain of a protein, e.g., a receptor¬ 
binding segment, so that the presence or location of the 
fused protein may be easily determined. See, e.g.. Dull, 
et al., U.S. Patent Mo. 4,859,609. Other gene fusion 
partners include bacterial f>—gulactos i dase, tipE, Protein 
A, 6-lactamase, alpha amylase, alcohol dehydrogenase, and 
yeast alpha mating factor. See, e.g., Godowski, et al. 
(1988) Science 241:812-816. 


Such polypeptides may also have amino acid residues 
which have been chemically modified by phosphorylation, 
sulfonation, biotinylation, or the addition or removal of 
other moieties, particularly those which have molecular 
shapes similar to phosphate groups. In some embodiments, 
the modifications will be useful labeling reagents, or 
serve as purification targets, e.g., affinity ligands. 

This invention also contemplates the use of 
d. 0 rivcitiv b s of t.hssc DC prorsins ocfisr rnon vciricitions in 
amino acid sequence or glycosylation. Such derivatives may 
involve covalent or aggregative association with chemical 
moieties. These derivatives generally fall into the three 
classes: (1) salts, (2) side chair, and terminal residue 
covalent modifications, and (3) adsorption complexes, for 
example with cell membranes. Such covalent or aggregative 
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derivatives are useful as immunogens, as reagents in 
immunoassays, or in purification methods such as for 
affinity purification of ligands or other binding ligands. 
For example, a DC protein antigen can be immobilized by 
5 covalent bonding to a solid support, such as cyanogen 

bromide-activated Sepharose, by methods which are well 
known in the art, or adsorbed onto polyolefin surfaces, 
with or without glutaraldehyde cross-1inking, for use in 
the assay or purification of anti-DC protein antibodies. 

10 The DC proteins can also be labeled with a detectable 

group, e.g., radioiodmated by the chloramine T procedure, 
covalently bound to rare earth chelates, or conjugated to 
another fluorescent moiety for use in diagnostic assays. 
Purification of these DC proteins may be effected by 
15 immobilized antibodies. 

Isolated DC protein genes will allow transformation of 
cells lacking expression of a corresponding DC protein, 
e.g., either species types or cells which lack 
corresponding proteins and exhibit, negative background 
20 activity. Expression of transformed genes will allow 

isolation of antigenically pure ceil lines, with defined or 
single specie variants. This approach will allow for more 
sensitive detection and discrimination of the physiological 
effects of these DC proteins. Subceilular fragments, e.g., 
25 cytoplasts or membrane fragments, can be isolated and used. 

VIII. Binding Agent:DC Protein Complexes 

A DC protein that specifically binds to or that is 
specifically immunoreactive with an antibody generated 
30 against a defined immunogen, such as an immunogen 

consisting of the amino acid sequence of SEQ ID NO: 2, 4, 

6, 8, 10, or 12, is determined in an immunoassay. The 

immunoassay uses a polyclonal antiserum which was raised to 
the protein of SEQ ID NO: 2, 4, 6, 8, 10, 12, or 

35 appropriate combination. This antiserum is selected to 

have low crossreactivity against other members of the 
related families, and any such cross reactivicy is removed 
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by immunoabsorption prior to use ir, the immunoassay. 
Immunoselection techniques may be used with the other 
members of the related family members. 

In order to produce antisera for use in an 
5 immunoassay, the protein of SEQ ID MO: 2, 4, 6, 8, 10, or 

12 is isolated as described herein. For example, 
recombinant protein may be produced in a mammalian cell 
line. An inbred strain of mice such as balb/c is immunited 
with the appropriate protein using a standard adjuvant, 

10 such as Freund's adjuvant, and a standard mouse 

immunization protocol (see Harlow and Lane, supra). 
Alternatively, a synthetic peptide derived from the 
sequences disclosed herein and conjugated to a carrier 
protein can be used an immunogen. Polyclonal sera are 
15 collected and titered against the immunogen protein in an 

immunoassay, e.g., a solid phase immunoassay with the 
immunogen immobilized on a solid support. Polyclonal 
antisera with a titer of 10^ or greater are selected and 
tested for their cross reactivity against other related 
20 proteins, using a competitive binding immunoassay such as 

the one described in Harlow and Lane, supra, at pages 57 0- 
573. Preferably two different related proteins are used in 
this determination ir. conjunction with a given DC protein. 
For example, with the Ig family protein, at least two other 
25 family members are used to absorb out shared epitopes. In 

conjunction with the LAMP family member, two other members 
of the family are used. These other family members can be 
produced as recombinant proteins and isolated using 
standard molecular biology and protein chemistry techniques 
30 as described herein. 

Immunoassays in the competitive binding format can be 
used for the crossreactivity determinations. For example, 
the protein of SEQ ID ID: 2, 4, 6, 8, 10, or 12 can be 
immobilized to a solid support. Proteins added to the 
assay compete with the binding of the antisera to the 
immobilized antigen. The ability of the above proteins to 
compete with the binding of the antisera to the immobilized 
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protein is compared to the protein of SEQ ID NO 2, 4 

10, or 12. The percent crossreac:ivity for the above 
proteins is calculated, using standard calculations. Those 
antisera with less than 10% crossreactivity with eacn of 
the proteins listed above are selected and pooled. The 
cross-reacting antibodies are then removed from the pooled 
antisera by immunoabsorpcion with the above-listed 
proteins. 

The immunoabsorbed and pooled antisera are then used 
in a competitive binding immunoassay as described above to 
compare a second protein to the immunogen protein (e.g., 
the DC protein of SEQ ID MO: 2, 4, 6, 8, 10, or 12). In 

order to make this comparison, the two proteins are each 
assayed at a wide range of concentrations and the amount of 
each protein required to inhibit 50% of the binding of the 
antisera to the immobilized protein is determined. If the 
amount of the second protein required is less than twice 
the amount of the protein of SEQ ID NO: 2 that is required, 
then the second protein is said to specifically bind to an 
antibody generated to the immunogen. 

It is understood that DC proteins are likely a family 
of homologous proteins that, comprise two or more genes . 

For a particular gene product, such as the human Ig family 
member protein, the invention encompasses not only the 
amino acid sequences disclosed herein, but also to other 
proteins that are allelic, polymorphic, non-allelic, or 
species variants. It also understood that the term "human 
DC protein" includes nonnatural mutations introduced by 
deliberate mutation using conventional recombinant 
technology-' such as single site mutation, or by' excising 
short sections of DMA encoding these proteins or splice 
variants from the gene, or by substituting or adding small 
numbers of new arnino acids. Such minor alterations must 
substantially maintain the imrnunoident ity of the original 
molecule and/or its biological activity. Thus, these 
alterations include proteins that are specifically 
immunoreactive with a designated naturally occurring 
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respective DC protein, for example, the human DC protein 
exhibiting SEQ ID NO: 8. Particular protein modifications 
considered minor would include conservative substitution of 
amino acids with similar chemical properties, as described 
above for each protein family as a whole. By aligning a 
protein optimally with the protein of SEQ ID NO 2, 4, 6, 8, 

10, or 12, and by using the conventional immunoassays 
described herein to determine immuneidentity, one can 
determine the protein compositions of the invention. 

IX. Uses 

The present invention provides reagents which will 
find use in diagnostic applications as described elsewhere 
herein, e.g., in the general description for developmental 
abnormalities, or below in the description of kits for 
diagnosis. 

DC genes, e.g., DNA or RMA may be used as a component 
in a forensic assay. For instance, the nucleotide 
sequences provided may be labeled using, e.g., -j-p or 
biotin and used to probe standard restriction fragment 
polymorphism blots, providing a measurable character to aid 
in distinguishing between individuals. Such probes may be 
used in well-known forensic techniques such as genetic 
fingerprinting. In addition, nucleotide probes made from 
DC sequences may be used in situ assays to detect 
chromosomal abnormalities. 

Antibodies and other binding agents directed towards 
DC proteins or nucleic acids may be used to purify the 
corresponding DC protein molecule. As described in the 
Examples below, antibody purification of DC proteins is 
both possible and practicable. Antibodies and other 
binding agents may also be used in a diagnostic fashion to 
determine whether DC components are present in a tissue 
sample or cell population using well-known techniques 
described herein. The ability to attach a binding agent to 
a DC protein provides a means to diagnose disorders 
associated with expression misregulation. Antibodies and 
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other DC protein binding agents may 
histological markers. 
the expression of each c 
specific tissue types. 
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antibody or nucleic acid 


:.o the respective DC protein, 
:o distinguish tissue anc 
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is possible to use the probe 
types in situ or in vitro. 

This invention also provides reagents which may 
exhibit significant therapeutic value. The DC proteins 
(naturally occurring or recombinant;, fragments thereof, 
and antibodies thereto, along with compounds identified as 
having binding affinity to the DC protein, may be useful in 
the treatment of conditions associated with abnormal 
physiology or development, including abnormal 
proliferation, e.g., cancerous conditions, or degenerative 
conditions. Abnormal proliferation, regeneration, 
degeneration, and atrophy may be modulated by appropriate 
therapeutic treatment using the compositions provided 
herein. For example, a disease or disorder associated with 
abnormal expression or abnormal signaling by a DC, e.g., as 

an antigen presenting cell, is a target for an agonist or 
antagonist of the protein. The proteins likely play a role 
in regulation or development of hematopoietic cells, e.g., 
lymphoid cells, which affect immunological responses, e.g., 
antigen presentation and the resulting effector functions. 

Other abnormal developmental conditions are known in 
cell types shown to possess DC protein mRNA by northern 
blot analysis. See Berkow fed.; Th p Merck Manual of 
Diagnosis and Therapy, Merck & Co., Rahway, NJ; and Thorn, 
et al. Harrison's Principles of Internal Medicine . McGraw- 
Hill, NY. Development a 1 or functional abnormalities, e.g., 
of the immune system, cause significant medical 
abnormalities and conditions which rr.ay be susceptible to 
prevention or treatment using compositions provided herein. 

Recombinant DC proteins or antibodies might be 
purified and then administered to a patient. These 
reagents can be combined for therapeutic use with 
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additional active or inert ingredients, e.g., m 
conventional pharmaceutically acceptance carriers or 
diluents, e.g., immunogenic adjuvants, along with 
physiologically innocuous stabilisers and excipients. In 
particular, these may be useful in a vaccine context, where 
the antigen is combined v/ith one of these therapeutic 
versions of agonists or antagonists. These combinations 
can be sterile filtered and placed into dosage forms as by 
lyophilization in dosage vials or storage in stabilized 
aqueous preparations. This invention also contemplates use 
of antibodies or binding fragments thereof, including forms 
which are not complement binding. 

Drug screening using antibodies or receptor or 
fragments thereof can identify compounds having binding 
affinity to these DC proteins, including isolation of 
associated components. Subsequent biological assays can 
then be utilized to determine if the compound has intrinsic 
stimulating activity and is therefore a blocker or 
antagonist in that it blocks the activity of the protein. 
Likewise, a compound having intrinsic stimulating activity 
might activate the cell through the protein and is thus an 
agonist in that it simulates the ceil. This invention 
further contemplates the therapeutic use of antibodies to 
the proteins as antagonists. 

The quantities of reagents necessary for effective 
therapy v/ill depend upon many different factors, including 
means of administration, target site, physiological state 
of the patient, and other medicares administered. Thus, 
treatment dosages should be titrated to optimize safety and 
efficacy. Typically, dosages used in vitro may provide 
useful guidance in the amounts useful for in situ 
administration of these reagents. Animal testing of 
effective doses for treatment of particular disorders will 
provide further predictive indication of human dosage. 
Various considerations are described, e.g., m Gilman, et 
a1. (eds. i (19 90) Go cb.m an and Gilman's: The 

Pharmacological Paces of Therapeutics (8th ed. ; Pergamon 
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Press; and (1990) Remington ' s Pharr-, a rent ical Sciences (17th 
ed.) Mack Publishing Co., Easton, PA. Methods for 
administration are discussed therein and below, e.g., for 
oral, intravenous, ir.traperitor.eal , or intramuscular 
administration, transdermal diffusion, and others. 
Pharmaceutically acceptable carriers will include water, 
saline, buffers, and other compounds described, e.g., in 
the Merck Index , Merck & Co., Rahway, NJ. Dosage ranges 
would ordinarily be expected to be in amounts lower than 1 
mM concentrations, typically less than about 10 |JM 
concentrations, usually less than about 100 nM, preferably 
less than about 10 pM (picomolar;, and most preferably less 
than about 1 £!••! ( f err.tom.ol a r ;■ , with an appropriate carrier. 
Slow release formulations, or a slow release apparatus will 
often be utilised for continuous administration. 

The DC proteins, fragments thereof, and antibodies to 
it or its fragments, antagonists, and agonists, could be 
administered directly to the host to be treated or, 
depending on the sice of the compounds, it may be desirable 
to conjugate them to carrier proteins such as ovalbumin or 
serum albumin prior to their administration. Therapeutic 
formulations may be administered in many conventional 
dosage formulations. While it is possible for the active 
ingredient to be administered alone, it. is preferable to 
present it as a pharmaceutical formulation. Formulations 
typically comprise at least one active ingredient, as 
defined above, together with one or more acceptable 
carriers thereof. Each carrier should be both 
pharmaceutically and physiologically acceptable in the 
sense of being compatible with the other ingredients and 
not injurious to the patient. Formulations include those 
suitable for oral, rectal, nasal, or parenteral (including 
subcutaneous, intramuscular, intravenous and intradermal) 
administration. The formulations may conveniently be 
presented in unit dosage form and may be prepared by any 


methods well known 
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Pharmrolna^ral Bases of Theraoeu-jcs (8th ed. ) Pergair.on 
Press; and (1990) Rpm.inaton 1 s Pharmac ent ical— Sciences (17th 
ed.) Mack Publishing Co., Easton, PA; Avis, et al. (eds.) 

( 1993 ) Pharmaceutical Dosage Forts: Parent eral Medications 
Dekker, NY; Lieberrnan, et al . (eds.' (1990) pharmaceut ica i. 

Dnsnap Fnrma■ Tablets Dekker, NY; and Lieberrnan, et al. 

(eds.) (1990) Pharmaceutical Dosaoe Forms: Di sperse Systems 
Dekker, NY. The therapy of this invention may be combined 
with or used in association with other chemotherapeutic or 
chemopreventive agents. 

Both the naturally occurring and the recombinant form 
of the DC proteins of this invention are particular la- 
useful in kits and assay methods which are capable cf 
screening compounds for binding activity to tne proteins. 
Several methods of automating assays have been developed in 
recent years so as to permit screening of tens of thousands 
of compounds in a short period. See, e.g., Fodor, et al. 
(1991) Science 251:767-773, and other descriptions of 
chemical diversity libraries, which describe means for 
testing of binding affinity by a plurality of compounds . 

The development of suitable assays can be greatly 
facilitated by the availability of large amounts of 
purified, e.g., soluble versions of, DC protein as provided 

by this invention. 

For example, antagonists can otter, be found on^e the 
protein has been structurally defined. Testing of 
potential protein analogs is now possible upon the 
development of highly automated assay methods using a 
purified surface protein. In particular, new agonists and 
antagonists will be discovered by using screening 
techniques described herein. Of particular importance are 
compounds found to have a combined binding affinity for 
multiple related ceil surface antigens, e.g., compounds 
which can serve as antagonists for species variants of a DC 
protein. 


invention is particularly 
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drug screening techniques. The advantages ct using a 
recombinant protein in screening for specific ligands 
include: (a) improved renewable source of the protein from 
a specific source; (b) potentially greater number of 
5 antigens per cell giving better signal to noise ratio in 

assays; and (c) species variant specificity (theoretically 
giving greater biological and disease specificity). 

One method of drug screening utilizes eukaryotic or 
prokaryotic host cells which are stably transformed with 
10 recombinant DNA molecules expressing a DC protein. Cells 

may be isolated which express that protein in isolation 
from any others. Such cells, either in viable or tixed 
form, can be used for standard surface protein binding 
assays. See also, Parce, et al. (198?) Science 246:243- 

15 247; and Owicki, et al. (1990) Pro-, hat '1 Acad. Sci. USA 

87:4007-4011, ’which describe sensitive methods to detect 
cellular responses. Competitive assays are particularly 
useful, where the cells (source of DC protein) are 
contacted and incubated with an antibody having known 
20 binding affinity to the antigen, such as 125 j - an tibody, and 

a test sample whose binding affinity to the binding 
composition is being measured. The bound and tree labeled 
binding compositions are then separatee, to assess the 
degree of protein binding. The amount of test compound 
25 bound is inversely proportional to the amount of labeled 

antibody binding to the known source. Many techniques can 
be used to separate bound from free reagent to assess the 
degree of binding. This separation step could typically 
involve a procedure such as adhesion to rliters followed by 
30 washing, adhesion to plastic followed by washing, or 

centrifugation of the cell membranes. Viable cells could 
also be used to screen for the effects of drugs on these DC 
protein mediated functions, e.g., antlgen presentation or 
helper function. 

Another method utilizes membranes from transformed 
eukaryotic or prokaryotic host cells as the source of a DC 
protein. These cells are stably transformed with DMA 
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"ild re prepared from the cells 


and used in binding assays such as the competitive assay 
set forth above. 

Still another approach is to use sciurilined, 
unpurified or solubilised, purified DC protein rrom 
transformed eukaryotic or prokaryotic host cells. This 
allows for a "molecular" binding assay with the advantages 
of increased specificity, the abmtv to automate, and high 
drug test throughput. 

Another technique for drug screening involves an 
approach which provides high throughput screening for 
compounds having suitable binging tir.it/ t^. the 
respective DC protein and is described in detail in Geysen, 


European Patent Application 84 


, published on 


September 13, 1984. First, large numbers of different 

small peptide test compounds are synthesised on a solid 
substrate, e.g., plastic pms or some other appropriate 
surface, see Fodor, et al . , supra. ihen all -‘ e pms are 


reacted with solubili 


unpurified or solubilised, 


purified DC protein, and washed. The next step involves 

detecting bound reagent, e.g., antibody. 

One means for determining which sites interact with 
specific other proteins is a physical structure 
determination, e.g., x-ray crysta-mgraphy or 2 dimensional 
NMR techniques. These will provide guidance as to which 
ammo acid residues form molecular contact regions. For a 
detailed description of protein structural determination, 
see, e.g., Blundell and Johnson <1^6) Prote in 
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Typically the kit will have a compartment containing either 
a defined DC peptide or gene segment or a reagent which 
recognizes one or the other, e.g., antibodies. 

A kit for determining the binding affinity of a test 
5 compound to the respective DC protein would typically 

comprise a test compound; a labeled compound, for example 
an antibody having known binding affinity for the protein; 
a source of the DC protein (naturally occurring or 
recombinant); and a means for separating bound from free 
10 labeled compound, such as a solid phase for immobilizing 

the DC protein. Once compounds are screened, those having 
suitable binding affinity to the protein can be evaluated 
in suitable biological assays, as are well known in the 
art, to determine whether they act as agonists or 
15 antagonists to regulate DC function. The availability of 

recombinant DC polypeptides also provide well defined 
standards for calibrating such assays. 

A preferred kit for determining the concentration of, 
for example, a DC protein in a sample would typically 
20 comprise a labeled compound, e.g., antibody, having known 

binding affinity for the DC protein, a source of DC protein 
(naturally occurring or recombinant: and a means for 
separating the bound from free labeled compound, for 
example, a solid phase for immobilizing the DC protein. 

25 Compartments containing reagents, and instructions, will 

normally be provided. 

Antibodies, including antigen binding fragments, 
specific for the respective DC or its fragments are useful 
in diagnostic applications to detect the presence of 
30 elevated levels of the protein and/or its fragments. Such 

diagnostic assays can employ lysates, live cells, fixed 
cells, immunofluorescence, ceil cultures, body fluids, and 
further can involve the detection of antigens in serum, or 
the like. Diagnostic assays may be homogeneous (without a 
35 separation step between free reagent and antigen-DC protein 

complex) or heterogeneous (with a separation step). 

Various commercial assays exist, such as radioimmunoassay 
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(RIA) , enzyme-1 inked immunosorbentassay (ELISA 1 , enzyme 
immunoassay (ElA), enzyme-multiplied immunoassay technique 
(EMIT), substrate-labeled fluorescent immunoassay (SLFIA), 
and the like. For example, unlabeled antibodies can be 
employed by using a second antibody which is labeled and 
which recognizes the antibody to the DC protein or to a 
particular fragment thereof. Similar assays have also been 
extensively discussed in the literature. See, e.g., Harlow 
and Lane (1988) Antibodies: A Laboratory Manual . CSH Press, 
NY; Chan (ed.) (1987) Immunoassay: A Practical Guide 

Academic Press, Orlando, FL; Price and Newman (eds.) (1991) 
Principles and Practice of Irrmum.-issav Stockton Press, NY; 
and Ngo (ed.) (1988; Nonisotonic Immunoassay Plenum Press, 

NY. In particular, the reagents may be useful tor 
diagnosing DC populations in biological samples, either to 
detect an excess cr deficiency cf DC m a sample. The 
assay may be directed to histological analysis of a biopsy, 
or evaluation of DC numbers in a blood or tissue sample. 

Anti-idiotypic antibodies may have similar use to 
diagnose presence of antibodies against a DC protein, as 
such may be diagnostic of various abnormal states. For 
example, overproduction of the DC protein may result in 
various immunological reactions which may be diagnostic of 
abnormal physiological states, particularly m 
proliferative ceil conditions such as cancer or abnormal 
dif ferentiaticn. 

Frequently, the reagents for diagnostic assays are 
supplied in kits, so as to optimize the sensitivity of the 
assay. For the subject invent i or., depending upon the 
nature of the assay, the protocol, and the label, either 
labeled or unlabeled antibody or receptor, or labeled DC 
protein is provided. This is usually in conjunction with 
other additives, such as buffers, stabilizers, materials 
necessary for signal production such as substrates for 
enzymes, and the like. Preferably, the kit will also 
contain instructions for proper use ar.d disposal of the 
contents after use. Typically the kit has compartments for 
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each useful reagent. Desirably, the reagents are provided 
as a dry lyophiiized powder, where the reagents may be 
reconstituted in an aqueous medium providing appropriate 
concentrations of reagents for performing the assay. 

Many of the aforementioned constituents of the drug 
screening and the diagnostic assays may be used without 
modification or may be modified in a variety of ways. For 
example, labeling may be achieved by covalently or non- 
covalently joining a moiety which directly or indirectly 
provides a detectable signal. In many of these assays, the 
protein, test compound, DC protein, or antibodies thereto 
can be labeled either directly or indirectly. 

Possibilities for direct labeling include label groups: 
radiolabels such as ^-^1, enzymes (U.S. Pat. No. 3,645,090) 
such as peroxidase and alkaline phosphatase, and 
fluorescent labels (U.S. Pat. No. 3,940,475) capable of 
monitoring the change in fluorescence intensity, wavelength 
shift, or fluorescence polarization. Possibilities for 
indirect labeling include biotinylation of one constituent 
followed by binding to avidin coupled to one of the above 
label groups. 

There are also numerous methods of separating the 
bound from the free protein, or alternatively the bound 
from the free test compound. The DC protein can be 
immobilized on various matrices followed by washing. 
Suitable matrices include plastic such as an ELISA plate, 
filters, and beads. Methods of immobilizing the DC protein 
to a matrix include, without limitation, direct adhesion to 
plastic, use of a capture antibody, chemical coupling, and 
biotin-avidin. The last step in this approach involves the 
precipitation of protein/antibody complex by one of several 
methods including those utilizing, e.g., an organic solvent 
such as polyethylene glycol or a salt such as ammonium 
sulfate. Other suitable separation techniques include, 
without limitation, the fluorescein antibody magnetizable 
particle method described in Rattle, et al. (1984) C1in . 

Chem. 3 0:1457-1461, and the double antibody magnetic 
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partic 1 e separation as describe^, i.. u.S- Pa^-. 

4,659,678 . 

Methods for linking proteins or their fragments to the 
various labels have been extensively reported m the 
literature and do not require detailed discussion here. 

Many of the techniques involve the use ot activated 
carboxyl groups either through the use of carbodiimide or 
active esters to form peptide bonds, the formation of 
thioethers by reaction of a mercapto group wit.n an 
activated halogen such as cnloroacety1, or an activated 
olefin such as rnaieirnide, for linkage, or the like. 

Fusion proteins will also find use in these applications. 

Another diagnostic aspect of this invention involves 
use of oligonucleotide or polynucleotide sequences taken 
from the sequence of a respective DC protein. These 
sequences can be used as probes for detecting levels of the 
message in samples from, patients suspected or having an 
abnormal condition, e.g., cancer or immune problem. The 
preparation of both RHA and DMA nucleotide sequences, the 
labeling of the sequences, and the preferred size of the 
sequences has received ample description and discussion m 
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binding to avidin or antibodies, which may be labeled with 
a wide variety of labels, such as radionuclides, 
fluorophores, enzymes, or the like. Alternatively, 
antibodies may be employed which car. recognize specific 
duplexes, including DMA duplexes, RNA duplexes, DNA-RNA 
hybrid duplexes, or DMA-protein duplexes. The antibodies 
in turn may be labeled and the assay carried out where the 
duplex is bound to a surface, so that upon the formation of 
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duplex on the surface, the presence of antibody bouna to 
the duplex can be detected. The use of probes to the novel 


anti -sense R.NA may b t 


rarrie; 
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nventional 


techniques such as nucleic acid hybridisation, plus and 
minus screening, recombinational probing, hybrid re-eased 
translation (HRT) , and hybrid arrested translation (h-.kT: . 
This also includes amplification techniques sum as 

polymerase chain reaction iPtRi. 

Diagnostic kits which also test for the qualitative or 
quantitative presence of other markers are also 
contemplated. Diagnosis or prognosis may depend on the 
combination of multiple indications used as markers. Thus, 
kits may test for combinations of Mixers. tee, e.g., 

Vial let, et al . (1989) Progress in Growth Factor Res ^, 1:89- 

97 . 
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XI. Binding Partner Isolation 

Having isolated one member of a binding partner of a 
specific interaction, methods exist tor isolating the 
counter-partner. See, Gearing, et a-. ( it 8.) u, • B '- 1 J • 

8:3667-3676. For example, means to label a DC surface 
protein without interfering with the binding to its 
receptor can be determined. For example, an affinity label 
can be fused to either the ammo- or carboxyl-terminus of 
the ligand. An expression IiDiary can be screened fo- 
specific binding to the DC protein, e.g., by cell sorting, 
or other screening to detect subpop-bat ions which express 

such a Dinning component . me, e, «... - 

, , , , r K .r ari.r-xt- 1 !' 1 "’ alternatively, 

Proc . Nat 1 1 ecad . rci . _bs - J *-- - - x ~ - 

a panning method may be used. See, e.g., jeeu a,.d . .--.-fo 
(1987) Proc. Nat 1 1 Acacb Sci. USA 84:3365-3369. A two- 

hybrid selection system may also be applied making 
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EXAMPLES 

I. General Methods 

Many of the standard methods below are described or 
5 referenced, e.g., in Maniatis, et al . (1382! Molecular 

Cloning. A Laboratory Manual Cold Spring Harbor Laboratory, 
Cold Spring Harbor Press, NY,- Sambrook, et al . (1389) 

Molecular Cloning: A Laboratory Manual (2d ed. ) Vols. 1-3, 

CSH Press, NY; Ausubel, et al., Biolpay Greene Publishing 
10 Associates, Brooklyn, NY; or Ausubel, et al. (1987 and 

Supplements) Current Protocols in Molecular Biology 
Wiley/Greene, NY; Innis, et al. (eds.) (1990) PCR 

Protocols: A Guide to Methods and Aon.icat ions Academic 
Press, NY. 

15 Methods for protein purification include such methods 

as ammonium sulfate precipitation, column chromatography, 
electrophoresis, centrifugation, crystallization, and 
others. See, e.g., Ausubel, et al. (1987 and periodic 

supplements); Deutscher (1990) "Guide to Protein 
20 Purification," Met hods in Snzvmo1oov voi. 182, and other 

volumes in this series; Coligan, et al. (1936 and periodic 

Supplements) Current Protocols in Protein Science 
Wiley/Greene, NY; and manufacturer's literature on use of 
protein purification products, e.g., Pharmacia, Piscataway, 
25 NJ, or Bio-Rad, Richmond, CA. Combination with recombinant 

techniques allow fu sion to cipTo it ooiao .0 segments , e.g., to a 

FLAG sequence or an equivalent which can be fused via a 
protease-removable sequence. See, e.g., Hochuii (1989) 
Chemi sche Industrie 12 : 6 9-7 0 ,- Hoc: iu 1 1 (199 0 ) " Pu r i f i ca t. i on 

30 of Recombinant Proteins with Metal ('delate Absorbent" in 

Setlow (ed.) Generic Engineering. Principle and Methods 
12:87-98, Plenum Press, MY; and Crowe, et al. (1992) 
OIAexpress : The Hi oh. Level Expressi—n & Protein 
Pu n fication System QUIAGEN, Inc., Chatsworth, CA. 

Standard immunelogical techniques are described, 
in Hert.zenberq et a 1 . (eds. I 996 Wei r ' s Handbook of 
Experiment al Immunology vol?. 1-4, Blackwell Science; 
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Coligan (1991) Current Protocols in Immunology 
Wiley/Greene, NY; and Methods in Er. memo logy volumes 70, 73, 

74, 84, 92, 93, 108, 116, 121, 132, 150, 162, and 163. 

Methods for determining immunological function are 
5 described, e.g., in Coligan, et al. (1992 and periodic 

Supplements) Current Protocols in Immunology V;i ley/Greene, 
NY. See also, e.g., Paul (ed.) (1993) Fundament a 1 

Immunology (3d ed.) Raven Press, N.Y. 

FACS analyses are described in Melamed, et al. (1990) 

10 Flow Cvtometrv and Sorting Wiley-Liss, Inc., New York, NY; 

Shapiro (1988) Practical Flow Cytometry Liss, New York, NY; 
and Robinson, et aI. ' 1 9 9 3' Handbook of Flow Cytometry 

Met hods Wiley-Liss, Mew Yo r k, NY. 

15 II. Generation of dendritic cells 

Human CD34+ cells were obtained as follows. See, 
e.g., Caux, et al. (1995) pages 1-5 in Banchereau and 

Schmitt Dendritic Cells in Fundamental and Clinical 
Immunology Plenum Press, NY. Peripheral or cord blood 
20 cells, sometimes CD34 + selected, were cultured m the 

presence of Stem Cell Factor (SCF) , GM-CSF, and TNF-cx in 
endotoxin free RPMI 1640 medium (GIECO, Grand Island, NY) 
supplemented with 10% (v/v) heat-inactivated fetal bovine 

serum (FBS; Flow Laboratories, Irvine, CA) , 10 rnM HEPES, 2 

25 mM L-glutamine, 5 X 10~ 5 M 2-mercaptoethanoi, penicillin 

(100 (jg/ml) . This is referred to as complete medium. 

CD34- cells were seeded for expansion in 25 to 75 cm 2 
flasks (Corning, MY) at 2 x IN cells .mi. Optimal 
conditions were maintained by splitting these cultures at 
30 day 5.and 10 with medium containing fresh GM-CSF and TNF-a 

(cell concentration: 1-3 x 10 5 cells/ml). In certain 
cases, cells were FACS sorted for CDla expression at about 
day 6 . 

In certain situations, cells were routinely collected 
35 after 12 days of culture, eventually adherent cells were 

recovered using a 5 rnM EDTA solution. Ir. other situations, 
the CDla+ cells were activated by re-suspension in complete 
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medium at. 5 x 10<5 ceils.'ml and activated for the 
appropriate time (e.g., 1 or 6 hi with - pg-nw pno^bol 1 

myristate 13-acetate (PM.A, Sigma) and 100 ng/rn^ lonomycm 
(Calbiochem, La Jolla, CA) . These cells were expanded for 
another 6 days, and BNA isolated for cDNA library- 
preparation . 

III. RNA isolation and library construction 

Total RNA is isolated using, e.g., the guanidine 
thiocyanate/CsC1 gradient procedure as described by 
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OLIGOTEX mRNA isolation kit. (QIAGEN) . Double stranded cDNA 
are generated using, e.g., the SUPERSCRIPT plasmid system 
(Gibco BRL, Gaithersburg, MD) for cDNA synthesis and 
plasmid cloning. The resulting double stranded cDNA is 
unidirectlonally cloned, e.g., into pSportl and transfected 
by electroporation into ELECTROMAX DH.10BTM cells (Gibco 
BRL, Gaithersburg, MD). 

Mouse or other species sources may also be used. 


IV. Sequencing 

DMA isolated from randomly picked clones, or after 
subtractive hybridization using unactivated cells, were 
subjected to nucleotide sequence analysis using standard 
techniques. A Taq DiDeoxy Terminator cycle sequencing kit 
(Applied Biosystems, Fester City, CA) can be used. The 
labeled DMA fragments are separatee, using a DN.e sequencing 
gel of an appropriate automated sequencer. .Alternatively, 
Che isolated clone is sequenced as described, e.g., in 
Maniatis, et al. (1982) Molecular Cloninn, A Laboratory 

Manua 1 . Cold Spring Harbor Laboratory, Cold Spimg Harbor 
Press; Sambrook, et al. (1989) Molecuiar—C loning ;—A 
Laboratory Manual , (2d ed.), vols. 1-3, CSH Press, NY; 

Ausubel, et al . , Biolouv . oreene P'uolishing .—.s^ociates , 
Brooklyn, NY; or Ausubel, et. al. (1987 and Supplements) 
Current Protocols In Mo lev: l ar P i o 1 ocry , Greene /V.’i ley , New 
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York. Chemical sequencing methods are al 


lilabie, 


e.g., using 


Maxam and Gilbert, sequencing techniques. 


V. Isolation of human DC protein genes 

The A05F12, the A07CC3, and E02B02 clones were 
sequenced, and analysed ter open reading frames. ..ic 
clones were further analysed to extend the nucleic acid 
sequence to a full, or nearly full, open reading frame. 

mRNA is prepared from appropriate cell populations by 
the FastTrack kit (Invitrogen) from which cDMA is generated 
using, e.g., Superscript Plasmid System for cDMA synthesis 
from GIBCO-BRL (Gathersburg, MD) essentially as described 
by the manufacturer. Modification to the procedure may 
include the substitution of other cloning adapters tor the 
Sail adapters provided with the kit . Tne resu-tant cDN.^ 
from these cells is used to generate libraries, e.g., m 
the plasmid PCDMA II (Invitrogen). The cDNA is cloned into 
the polylinker and is used to transtorm an appropriate 
strain, e.g., DH10B, of E. coii. Plasmid is isolated and 
purified, e.g., with the Qiagen system (Chatswcrth, CA) 
which is used to generate P.MA probes from, e.g., the SPo 
promoter. 

RNA probes are labeled, e.g., using tne uenius System 
(Boehringer-Mannheim) as descrioea oy the manufacturer. 
Filter lifts of the cDNA library can be pre-hybridized, 
e.g., at 42' C for 3-6 hours in Church's buffer (50%; 
formamide, 6X SSPE, 5C mM MahPov pc. . .2, > « CDS, j . 1 <■ 

N-Laury 1 sarccsir.e, 2% Roehrmger -Mannneim n- coxing 
reagent). Filters are probed, e.g., overnight m tne same 
buffer containing the appropriate probes. The niters are 
washed, e.g., as described by the Genius System. The 
colonies that hybridize are selected. 

The entire cDNA of human DC proteins are sequenced, 
e.g., by the dideoxynucleotide chain termination method 
with T7 polymerase (U.S. Biochemicals, Cleveland, OH) using 
double-stranded DMA as template. Data base searching and 
sequence analysis are performed using Intel1iGenetics 


tm, e.g., the SPe 
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programs {Mountain ^e.., " . 

exists between previously reported clones. 

Table 1 discloses sequence encoding a human di- 

ubiquitin protein, which contains two uciquiv.r. cmma-m 

which extend from about 1 (met) to about b3 '.pro; 

about 89 (pro) to about 165 (glyi• The putative 

polypeptide sequence comprises four cysteine residues which 

are not characteristic of a human ubiquitm domain. 

Related proteins are reported m, e.g., Nrasimhan, et al. 

( 1996 ) ,t . Biol . Chen. 271:324-330 ; Lowe, et ai . (19 95) 3L. 

Pathology 177:163-165; Loeb and Haas (1994) Mm . and cel .,1^ 


Biol . 


cp.em. 


Bn o1 . 14:8408-8419; Loeb and Haas 
267:7806-7813. The diubiquitin protein also exhibits 
similarity to the monoclonal non-specific suppressor factor 
beta (MNSFb) produced by mouse and human T cells. The 
MNSFb is a protein of about -33 residues with an N terminus 
similar to ubiquitm and a C-termmus similar to the S30 
ribosomal protein. The MNSFb protein is cleaved into two 
portions in the cytoplasm, the ubiquitm-like domain is 
secreted. The MNSFb is reported to inhibit the generation 
of LPS-induced immunoglobulin secreting ceils, the 
proliferation of mitogen-activated T and 3 celis, tne IL-4 
secretion by bone marrow derived mast -ae-ls, and the growth 
of various murine tumor ceil lines. See Madamura, et al. 

(1995) Broc. Mat 


A c a d . £ c l . U S e. 8 2 

"5 ') _ C. 'I 


63-3467; Nakamura, 
Nakamura, et. al . 

, et al. (1995) 




et al . (1996) ,7. Immunol, mb: 

(19 0 5 ) Bnr . 7 . I mm: :no 1 . 2 5:241.'- 

Irnrnun obi clone 192 : 26 c-2.1; /.av i e 
Tmmunol. 152 : 2624 - 2b3c; and others. 

Note that the ubiquitm conserved residues 48 (lys) 
and 70 (lys) are present here, which residues have been 
implicated in protein binding. The terminal glycine 
doublet is also characteristic of tne proteins, 
sequence was isolated from a 

library, and exhibits substantia- men-, my *. 

nmii ;,rU r\ bin SSCE11. The tirst 


This 

j CDla dendritic cell 
th EST 
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downstream from a Kozak consensus ocqjcntc „jagesuj.n a 
it may be the initiation codon. 

Ubiquitination may play a role in the generation 
the MHC class I binding epitope in antigen presentation. 

This suggests that the protein may play a role m the DC 
function of antigen presentation. The gene seems to be a 
single copy gene, and exhibits no r.omologs when a low 
stringency Southern hybridization is performed. PCR 
analysis indicates that the message is present at high 
levels in dendritic cells; at lower levels m JY (B cell 
line); at even lower levels in CHA (carcinoma cell line); 
and not detected in TF1 (hematopoietic cell line!, Jurkat 
(T cell line), MRC5 (lung fibroblast sarcoma ceil line), or 
U937 (pre-monocyte cell line). On Northern analysis, a 
single band of 0.9 kb was seen m dendritic s, but was 

less intense with JY ceils, and not detected in the other 
cell lines tested by PCR. _n a more physiological 
analysis, the levels were very low to undetectable in fresh 
T cells, granulocytes, E cells, and monocytes. Thus, the 
protein is a useful marker for quantitating or 
distinguishing dendritic ceils from these other cell types. 
The analysis may be destructive of the cells. Tissue 
distribution in fetal tissue message blots showed no 
detectable presence in brain, lung, liver, ana kidney, out 
no positive control was present. Similarly, m aouit 
tissues: heart, brain, placenta, lung, liver, skeletal 
muscle, kidney, pancreas, spleen, thymus, prostate, testis, 
ovary, small intestine, colon, or PEL, though there is no 

positive tissue control rn me sample. 

The A07CC3 sequence encoding a protein related to Ig 
family members was also isolated from an activated CDla 
dendritic cell library. At about positions 578 and 710, 
various isolates have various insert ions/deletions which 
suggest positions or intron splicing. putative 

hydrophobic stretch or signal sequence may run from about 1 
(met) to about 22 (val), and a porential transmembrane 


seciment runs 
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suggests that the 


otein is a membrane protein, ano may 
well be a receptor for another cel surface molecule in ar. 
interacting cell. Certain cysteine residues, e -S-> c.t 
positions 51 and 118 are characteristic of Ig domains. A 
region similar to the J chain of a type 1 variable chain 
runs from about 134 (gly) to about 141 ival). Two putative 
glycosylation sites are found in the part ammo proximal to 
transmembrane portion, with various putati.e 
phosphorylation sites in the carboxy proximal part. 

Sequence analysis suggests AQ7C03 is a member of the 
Ig superfamily of receptors, and is closely related to the 
CD8 family, which members contain a VlJ-type fold. ihe 
prediction of sice and start of protein is based, m part, 
upon those sequence comparisons. The analysis also 
includes secondary sequence structural analysis. A mouse 
counterpart is probably encoded in the EST W55567, isolated 
from brain. The sequence exhibits some homology with the 
R95734 sequence, and low similarity with a number of 
immunoglobulin V chain regions, particularly with the 
domain 2 of the polymeric immunoglobulin receptor (plgR)• 
The plgR is expressed in a wide variety or cel^ types, 
including epithelial and neuronal cells, and is 
transcytosed toward the cell membrane in a specific manner 
and binds to and internalizes polymeric immunoglobulins, 
particularly di-IgA. A soluble form;, secretory component, 
is implicated in the secretion of immunoglobulins. See, 
e.g., Cardone and Mostov (1995) FEBS Letm 376 :74-7o ; 


Nihei, et al 


! 1 Q Q S 


trch . Dp rma t o I on 


287 : 546-552; de 
.4 5 9; Ferkol, 




Mazanec 


et 


Hoop, et al. (1995) J, Biol. Che-. 1 
et al. (1995) J. Clin. Invest . 95:493-502; 

a 1 . (19 9 5! U. Virol. 69:1339-1343. 

The CD8 family members are typically dimers, and the T 
cells receptors are quasi-homodimers. These receptors 
typically bind to heterophi11 i c ligands, and the A07C03 is 
likely to bind to a molecule on an interacting cell type. 


e.g., 


or B cell, or other 


tell 


/pe found in the 
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st sarcoma ceil 1 

ine) , JY ■' 

ocyte cel 

1 line) 

In adult 

1 and 2 . 

5 kb are 

seen in s 

, brain, 

placent a 

, lung, 1 

pancreas, 

thymus, 

prostate, 


germinal centers where the dendritic cells pertarm critical 
roles in the initiation and control of immune responses. 

RT-PCR provides a strong signal only m dendritic 
cells. Northern blot analysis gives a single band at about 
1 kb in activated or resting DC and monocytes, but no 
detectable signal is seen m activated T cells, 
granulocytes, resting or activated PBb, or B ceils. No 
detectable signal is seen in TFl (hematopoietic cell line), 
Jurkat (T cell line), CHA (carcinoma cell line), MRC5 (lung 

i cell line), or U937 
tissues, two messages 
:leen and PBMC, but not 
rer, skeletal muscle, 
ymus, prostate, testis, ovary, small 

intestine, or colon. 

A sequence encoding a protein related to LAMP-like 
family members, designated E02B02, was isolated from human 
CDla dendritic cells. The initiation methionine is not 
found in this clone, but sequence analysis suggests that it 
is not far upstream of the sequence provided. The encoded 
protein exhibits homology to Lysosorne-Associated Membrane 
Protein (LAMP) family, see human lysosomal LMPl (P11279) 
and LMP2 (P491305 and CD 68 ;?3481Q;. Notable reatures are 
a hydrophobic length from about -23 (met) to about -1 
(ser), putatively a signal sequence; a likely transmembrane 
segment from about ile359 to ieu383; and a serine/proline 
rich stretch suggestive of a hinge from about prol84 to 
ser 19 9 . Residues arcj3 84 to ile392 are a cytoplasmic tail. 

Northern blot analysis gives a single bano at about 
3.5 kb in resting or activated DC, but no detectable signal 
is seen in monocytes, activated T cells, granulocytes, 
resting or activated PEL, or B cells. A strong positive 
signal is seen in dendritic cells and a weak signal in 
Jurkat (T ceil line), but no detectable signal is seen in 
TFl (hematopoietic cell line), CHA (carcinoma ceil line), 
MRC5 (lung fibroblast sarcoma cell 1 me) , JY >'B ceil line), 
or uq 37 [pr-p -inonoc vt e ceil line). In adult tissues, a 
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message of about 3.5 kb is seen ir. _ iver, out not in heart, 
brain, placenta, lung, skeletal muscle, kidney, pancreas, 
spleen, thymus, prostate, testis, ovary, small intestine, 
colon, or PEL. Tissue distribution in fetal tissue message 
blots showed no detectable presence in brain, lung, liver, 
and kidney, but no positive control was present. 

The E02B02 message is weakly expressed in human cord 
blood progenitors cultured in the presence of GM-CSF and 
TNFot into dendritic cells, at the 6 day stage. In 
contrast, at days 12-16, when precursors mature into 
dendritic cells with typical DC morphology and phenotype, 
large amounts of message are detected. PCR analysis 
detected expression also in langerhan.s cells, but not in a 
population of basal cells containing mostly keratinocytes. 
PMA-ionomycin activated macrophages generated in vitro from 
CD34+ progenitors cultured with M-CSF express the message. 
E02B02 expression is upregulated after CD40L activation in 
monocyte-derived dendritic cells, as well as m 
CD4+CD11c+CD3- dendritic cells isolated ex vivo from 
tonsillar germinal centers. 

FACS analysis reveals that DC-LAMP expression 
increases during DC maturation. It thus represents a 
useful marker for mature dendritic cells. On tonsil 
sections, DC-LAMP stains specifically mterdigitating DC in 
the T cell area, but not in the germinal center. Thus, the 
CD-LAMP is a useful specific marker for mterdigitating DC. 

Confocal microscopy suggests that this DC-LAMP may 
specifically stain a novel lysosomal compartment. In 
comparison with the LAMP-! and LAMP-2 proteins and Class II 
on CD34+ derived DC, the DC-LAMP stains different 
lysosomes, possibly the same ones involved m the transport 
of MHC-Class II complexes. 

The lysosomal protein homology suggests that this 
protein may be involved in that compartment ir: the DC, and 
possibly related to degradation of protein in antigen 
presentation functions. 
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VI. Recombinant- DC gene construct 

Poly(A) + RNA is isolated from appropriate cell 
populations, e.g., using the FastTrack mRMA kit 
(Invitrogen, San Diego, CA). Samples are electrophoresed, 
e.g., in a 1% agarose gel containing formaldehyde and 
transferred to a GeneScreen membrane (13EM Research 
Products, Boston, MA) . Hybridisation is performed, e.g., 
at 65' C in 0.5 M NaHP0 4 pH 7.2, 7% SDS, 1 mK EDTA, and 1% 

BSA (fraction V) with 22 P-dCTP labeled DC gene cDNA at 10 7 
cpm/ml. After hybridization filters are washed three times 
at 50' C in 0.2X SSC, 0.1% SDS, and exposed to film for 24 

h. 

The recombinant gene construct may be used to generate 
probe for detecting the message. The insert may be excised 
and used in the detection methods described above. 


VII. Expression of DC gene Protein in E. coli. 

PCR is used to make a construct comprising the open 
reading frame, preferably m operable association with 
proper promoter, selection, and regulatory sequences. The 
resulting expression plasmid is transformed into an 
appropriate, e.g., the Topp5, E. coil strain (Stratagene, 
La Jolla, CA) . Ampiciilin resistant !50 (.ig/mlj 
transformants are grown in Luria Broth (Gibco) at 37 * C 
until the optical density at 550 nm is 0.7. Recombinant 
protein is induced with 0.4 irM 

isopropyl-pD-thiogaluctopyrar.cside ;Sigma, St. Louis, MO) 


and incubation of the cells cent 
further 18 hours. Cells from a 
harvested by centrifugation and 
ml of ice cold 30%. sucrose, 5 0 m 
ethylenediammet et raacet ic acid . 
cold water is added to a total v 
20 min. on ice, cells are remove 
supernatant is clarified by tilt 


ir.f.sd at 2 C * C for a 
1 liter culture are 
resuspended, e.g., in 200 

M Tris HC1 pH 8.0, 1 mM 

After 10 mm. on ice, ice 
oiume of 2 liters. After 
d by centrifugation and the 
rat ion via a 5 (.iM Miilipak 


60 (Miilipore Corp., Bedford, MA 
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The recombinant 

prot 

ein is puri 

fied via standard 

purification methods 

, e . g 

., various 

ion exchange 

chromatography metho 

ds 

Immunoaffin 

ity me t hod s using 

antibodies described 

bel o’ 

w can also 

be used. Affinity 

methods may be used ’ 

where 

an epitope 

tag is engineered 


an expression construct. 


VIII. Mapping of human DC genes 

DMA isolation, restriction enzyme digestion, agarose 
gel electrophoresis. Southern blot transfer and 
hybridization are performed according to standard 
techniques. See Jenkins, et ai. .itbi; J. Virol. 43:26-36. 
Blots may be prepared with Hyfoond-M nylon membrane 
(Amersham). The probe is labeled with J “P-aCTP; washing is 
done to a final stringency, e.g., of 0.IX SSC, 0.1% SDS, 

65 ' C . 

Alternatively, a BIOS Laboratories (New Haven, CT) 
mouse somatic cell hybrid panel may be combined with PCR 
methods. The diubiquitin gene matches the EST 'J37231, 
which maps to the MHC Class I region, e.g., human 
chromosome 6. See Fan, et ai. (1956) Immunoaenetics 44:97- 

103 . 

The E02B02 gene has been mapped to human chromosome 3, 
bands 3q26.3-q27. This is a different chromosomal 
localization than the LAMP-1 and LAMP-2 genes. 

IX. Analysis of individual variation 

From the dist r ifc•_:t ior. da t a , ar. atbur.dant eas 1 1 y 
accessible cell type is selected for sampling from, 
individuals. Using PCR techniques, a large population of 
individuals are analyzed for this gene. cDNA or other PCR 
methods are used to sequence the corresponding gene in the 
different individuals, and their sequences are compared. 
This indicates both the extent of divergence among racial 
or other populations, as well as determining -which residues 
are 1 ikely to be modifiable withour dramatic effects on 
function. 




5 


X. Preparation of Antibodies 

Recombinant DC proteins are generated by expression m 
E. coli as shown above, and tested for biological activity. 
Active or denatured proteins may be used for immunization 
of appropriate mammals for either polyclonal serum 
production, or for monoclonal antibody production. 


XI. Isolation of counterpart primate or rodent DC genes 
Human cDNA clones encoding these genes are used as 
probes, or to design PCR primers to find counterparts in 
various primate species, e.g., chimpanzees. u .kev,ise, 
mouse sequences mav be used to isolate counterpart 


?s nay be usee. 


sequences from other rodent species. 


XII. Use of reagents to analyze cell populations 

Detection of the level of dendritic cells present in a 
sample is important for diagnosis ot aberrant disease 
conditions. For example, an increase in the number of 
dendritic cells in a tissue or the lymph system can be 
indicative of the presence of a DC hyperplasia, or tissue 
or graft rejection. A low DC population can indicate an 
abnormal reaction to, e.g., a bacterial or viral infection, 
which may require the appropriate treat to normalize the DC 


response. 

FACS analysis using a labeled binding agent specific 
for a cell surface DC protein, see, e.g., Melamed, et. al. 
(1990) Flow Cvt ome- rv and Sort inn V.'i ley - Liss , Ir:c., New 
York, NY; Shapiro (19HBi Practical Flow Cytomerry Liss, New 
York, NY; and Robinson, et al. (1992) Handbook of Fl ow 

(~vt-_omet~rv Methods VJiley-Liss, New York, NY, is used in 
determining the number of DCs present m a cell mixture, 
e.g., PBMCs, adherent cells, etc. The binding agent is 
also used for histological analysis or tissue samples, 
either fresh or fixed, to analyze infiltration of DC. 
Diverse cell populations may also oe evaluated, either ^n a 


35 


BATES, st. ci_ . 




cell destructive assay, o: 


rertain assays v.'iiere ceils 


retain viability. 

Analysis of the presence or soluble intracellular 
molecules is performed, e.g., with a fluorescent binding 

agent specific ror a ao desc^-*-■_. -n - - 

(1995) I p vi K=-d 182 :1357-1367 . alternatively, tissue or 

cell fixation methods may be used. 

Levels of DC transcripts are quantitated, e.g., using 
semiquantitative PCR as described m Murphy, et al. (1993, 

■T t mum no 3 Methods 162:211-223. Primers are designed such 

that genomic DMA is not detected. 


XIII. Isolation of a binding counterpart 

\ jjo protein can be usee as a specific binding 
reagent, by taking advantage of its specificity of binding, 
much like an antibody would be used. A binding reagent is 
either labeled as described above, e.g., fluorescence or 
otherwise, or immobilized to a substrate for panning 
methods. 

The DC protein is used tc screen for a cell line which 
exhibits binding. Standard staining techniques are used to 
detect or sort intracellular or surface expressed ligand, 
or surface expressing transformed cells are screened by 
panning. Screening of intracellular expression is 
performed by various staining or immunofluorescence 
procedures. See also McMahan, ec al. (1991) EMBO 6U 

10:2821-2832. 


For example, on day 0, precoat 2-chamber permanox 
slides with 1 ml per chamber of fibronectin, 10 ng/ml in 
PBS, for 30 min. at room temperature. Rinse once with PBS. 
Then plate COS ceils at 2-3 x 10 5 cells per chamber m 1.5 
ml of growth media. Incubate overnight at 3/ C. 

On day 1 for each sample, prepare 0.5 ml of a solution 
of 66 mg/ml DEAE-dext ran , 66 mM chlcroquine, and 4 mg DMA 

^ ^ ra.-ir rnr CAft . ci CCSlt 1VG CCHtrol is 


in serum free DM 
prepared, e.g., 


For each set, 

V-* -i i T-r-, C, p V- 0 0 0 t- Q y 


G cDMA at 1 and 1/200 
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dilution, and a negative mock. 
DME. Add the DMA solution and i 
Remove the medium and add 0.5 ml 


-;se cells with serum free 
:bate 5 hr a: 37' C. 


DMSO in DM! 


:or 


rov.t 


min. Remove and wash once with DME. Add 1.5 ml 
medium and incubate overnight. 

On day 2, change the medium. On days 3 or 4, the 
cells are fixed and stained. Rinse the cells twice with 
Hank's Buffered Saline Solution (HBSS) and fix in 4% 
paraformaldehyde (PF.A)/glucose for 5 mm. Wash 3X with 
HBSS. The slides may be stored at -80“ C after all liquid 
is removed. For each chamber, 0.5 ml incubations are 
performed as follows . -Add HBSS/saponin ( C . i * ) with i2 ml, ml 


min . 


. ’ i 


are 


: hen washed wit h 


Add protein or protein/antibody complex 


of 1M NaN 3 for 
HBSS/saponin IX 
to cells and incubate for 30 min. Wash cells twice with 
HBSS/saponin. If appropriate, add first antibody for 30 
min. Add second antibody, e.g.. Vector anti-mouse 
antibody, at 1/200 dilution, and incubate for 30 min. 
Prepare ELISA solution, e.g., Vector Elite ABC horseradish 
peroxidase solution, and preincubate for 30 mm. Use, 
e.g., 1 drop of solution A (avidirm ana 1 drop solution B 

(biotin) per 2.5 mi HBSS/saponin. Wash ceils twice with 
HBSS/saponin. Add ABC HRP solution and incubate for 30 
min. Wash cells twice with HBSS, second wash for 2 min., 
which closes cells. Then add Veer or diaminobenzoic acid 
(DAB) for 5 to 10 min. Use 2 drops of buffer plus 4 drops 

HoCh per 5 mil of glass distilled water. 

e oiide in water. Air dry 

for a few minutes, then add 1 drop of Crystal Mount and a 
cover slip. Bake for 5 mm. at 85-90 * C. 

Alternatively, other DC protein specific binding 
reagents are used to affinity purify or sort out cells 
expressing a receptor. See, e.g., Sambrook, et al. or 
Ausubel, et al. 

Another strategy is to screen for a membrane bound 
receptor by panning. The receptor cDM.A is constructed as 
described above . The ligand can fc • mmobi 1 1 c< 


DAB plus 2 drops 


Carefully remove chamber ar.d rmse '• 1 


•d and used to 
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immobilize expressing cells. Immobilization may oe 
achieved by use of appropriate antibodies which recognize 


e.g., a FLAG sequence o 


^ DC 


protein fusion construct, or 


by use of antibodies raised against the first antibodies. 
Recursive cycles ot selection and citp:i^icat*^.i lead to 


eventual 


Lac ion oi 


enrichment of appropriate clones an 
ligand expressing clones. 

Phage expression libraries can be screened by DC 
protein. Appropriate label techniques, e.g., anti-FLAG 
antibodies, will allow specific labeling of appropriate 
clones. 

All references cited herein are incorporated herein by 
reference to the same extent as earn inaiviaudi 
publication or patent application v.vts spe^x^-^a_l\ and 
individually indicated to be incorporated by reference in 

its entirety for all purposes. 

Many modifications and variations of this invention 
can be made without departing from its spirit and scope, as 
will be apparent to those skilled in the art. The specific 
embodiments described herein are offered by way ot example 
only, and the invention is to be limited only by the terms 
of the appended claims, along with tne f.:d bC °P fe of 
equivalents to which such claims are 


entitled. 
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RELATED REAGENTS 

(iii) NUMBER OF SEQUENCES: 12 

(ivj CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: DMAX Research Institute 
IB) STREET: 501 California Avenue 

< C ) C I TV : Pa 1 r> Alto 

( D ) STATE : C'a lit cm l a 
!E) COUNTRY: USA 

< FJ DIF': 54 304-11 (• 4 

(v) COMPUTER READABLE FORM: 

(A ) MEDIUM TYPE: Floppy disk 
{B ) COMPUTER: I EM PC compatible 
(C J OPERATING SYSTEM: PC-DOS 'MS - DOS 
(D; SOFTWARE: Patent In Release «i .C , 


(Vi) CURRENT APPLICATION DATA: 

iA) APPLICATION NUMBER: US 
FILING DATE: 2 5-NOV-199" 
i C) C LASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

•: p ■ APPLICATION NUMBER: US ; 
(E) FILING DATE: 11-DEC-1P9: 


. S 1 O!. “ 1 . i 1 




BATES, et al . 



>9K 
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10 


(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60-031.806 
(E) FILING DATE: 27-NOV-1595 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Ching, Edwin P. 

(E) REGISTRATION NUMBER: 34,090 

(C) REFERENCE/DOCKET NUMBER: DX0669K 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (650)852-9196 

(B) TELEFAX: (650)496-1204 


15 


20 


25 


30 


35 


40 


45 


50 


(2) INFORMATION FOR SEQ ID NO:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDMA 


(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 19.. 513 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 


GGCCCCTTGT CTGCAGAG 

ATG 

GCT 

CCC 

AAT 

GCT 

TCC 

TGC 

CTC 

TGT 

GTG 

CAT 

51 






Met 

Ala 

Pro 

A s ri 

Ala 

o e r 

Cys 

Leu 

Cys 

Val 

His 







1 




5 





10 



GTC 

CGT 

TCC 

GAG 

GAA 

TGG 

GAT 

TTA 

ATG 

ACC 

TTT 

'oA i 

GCC 

AAC 

CCA 

TAT 

99 

Val 

Arg 

Ser 

Glu 

Glu 

Trp 

Asp 

Leu 

Met 

Thr 

Phe 

Asp 

Ala 

Asr, 

Pro 

Tyr 





15 










2 5 




GAC 

AGC 

GTG 

AAA 

AAA 

ATC 

A.-iA 

GAA 

CAT 

GTC 

CGG 

TCT 

AAG 

ACC 

AAG 

GTT 

147 

Asp 

Ser 

Val 

Lys 

Lys 

lie 

T - • 

Glu 

His 

Va 1 

Ai' c; 

.?> e r 

Lys 

Thr 

Ly s 

Va 1 




3 0 





C 





4 0 





CCT 

GTG 

CAG 

GAC 

CAG 

GTT 

CTT 

TTG 

CTG 

GGC 

TCC 

-* •> 

rtrto 

ATC 

TTA 

AAG 

CCA 

195 

Pro 

Val 

Gin 

Asp 

Gin 

Va 1 

Leu 

Leu 

Leu 

Gly 

Ser 

Lys 

lie 

Leu 

Ly s 

Pro 



45 





50 





55 






CGG 

AGA 

AGC 

CTC 

TCA 

TCT 

TAT 

GGC 

ATT 

GAC 

AAA 

GAG 

AAG 

ACC 

ATC 

CAC 

243 

Arg 

Arg 

Ser 

Leu 

Ser 

Ser 

T-yr 

Gly 

He 

Asp 

Lys 

Glu 

Lys 

Thr 

lie 

His 


60 





55 










75 


CTT 

ACC 

CTG 

AAA 

GTG 

GTG 

A.AG 

CCC 

AGT 

GAT 

GAG 

gag 

CTG 

C C C 

TTG 

TTT 

291 

Leu 

Thr 

Leu 

Lys 

7a 1 

Va l 

Lys 

Pro 

S e r 

Asp 

G1 u 

Glu 

Leu 

Pro 

Leu 

Phe 



55 


6 b 



BATES 



CTT 

GTG 

GAG 

TCA 

GGT 

GAT 

OrV-j 

V_J i 


AGG 

v. 

CTC 

CTC 

CAG GTG 

U on 


Leu 

Va 1 

Glu 

ii’ e r 

Gly 


Cj 1. U 

r\ 1 A 

Lys 

Arg 

H i s 

Leu 

Leu 

Glr. Val 

Arg 





9 5 





1 0 C 





10 5 


5 

AGG 

TCC 


TCA 

GTG 

GCn 


GTG 



r.mg 


GAG 

ACT AAG 

05 



V. » » 






Arg 

Ser 

Ser 

Ser 

Va 1 

Ala 

Glr. 

va 1 

Ly s 

A1 A 

Met 

lie 

Glu 

Thr Lys 

Thr 




110 





115 





1 ° 0 



10 

GGT 

ATA 

ATC 

COT 

GAG 

ACC 

SAG 

ATT 

GTo 

A v_ * 

TGC 

ri.n i 

Gvj/i 

A. AG AG A 

■~TG 


Gly 

lie 

lie 

Pro 

Glu 

Thr 

Gin 

lie 

. A 1 

Thr 

Cys 

Asr. 

Gly 

Lys Arg 

Leu 



125 





130 





i t c; 

-i- -j 





GAA 

GAT 

GGG 

AAG 

rnc 
/-i l 

ATG 

GCA 

GAT 

TAC 

GGC 

ATC 

rtLt A 

AAG 

GGC AAC 

TTA 

15 

Glu 

Asp 

Gly 

Lys 

Met 

Met 

AI A 

Asp 

T.' V 

Gly 

He 

Arc; 

Lys 

G i. y s 11 

Leu 


140 





145 





1 5 0 




155 


CTC 

TTC 

ctg 

G C hh 

tct 

m ’ 'T' 

TGT 

ATS' 

gga 

GGG 

TGAC 

C ACC 

' C T < 

GGGGATGG: 

GG 


Leu 

Phe 

Leu 

Ala 

.Per 


Cy s 

He 

G ly 

' - ■> 1 \' 







20 150 


TGTTGGCAGG GG TC A AAA AG CTTATTTCTT TTAATCTCTT A 




C ACATCTTCT 


25 


GATGATTTCC CAAAATTAAT GAGAATGAGA TGAGTAGAGT AAGATTTGGG TGGGATGGGT 
AGGATGAAGT ATATTGCCCA ACTCTATGTT TCTTTGATTC TAACACAATT AATTAAGTGA 
CATGATTTTT ACTAATGTAT TACTGAGACT AGTAAATAAA TTTTTAAGGC AAAATAGAGC 


3 0 ATTC 


(2) INFORMATION FOR 3EQ ID NO : 2 : 


35 


(i) SEQUENCE CHARACTERISTIC-? : 

(A) LENGTH: 165 amine acids 
! B ) T V P E : a n, i r* r. a c i d 
[ D| TOPOLOGV: 1 ln ear 


40 


45 


( i i ; MOLECULE TYPE: protein 
( x i ) .SEQUENCE OE?CR I FT I Cl.’: .SEQ 


Met Ala Pro Asn 
1 


r.ia r 


Val His Vii 1 Arg Ger Glu Glu 
10 15 


50 


Trp Asp Leu Met Thr Phe Asp Ala Asn Pi 
2 0 2 5 


'r Asp i,er va. 


Ly s 


He Lvs Glu His Val Arg Per Lys Thr Lys Val Pro Val Gin Asp Gin 


Val Leu Leu Leu 
50 


Lys lie Leu Lys Pro Arg Arg .O'er Leu 

5 1 ri 0 


387 

435 

483 

533 

593 

653 

713 

773 

777 


55 





- 






£ 

) 



BATES, 

ai™ 






r 

^ ■ v 


Ser 

Tyr 

G1’,' lie Asp La’s cj . u Ly s 

Thr 

lie 

H i s 

Leu 

rpv_ v . 

Leu 

Lys Val 


6 5 


~ 0 



r 




O J 


Va 1 

Lys 

Pro Ser Asp Glu Glu Leu 

Pro 

Leu 

Phe 

Leu 

Va 1 

Glu 

Ser Gly 

5 


35- 


9 C- 





Z cc 


Asp 

Glu 

Ala Lys Arg His Leu Leu 

Gin 

. a 3 . 

Ar g 

Ar g 

a e r 

Ger 

Ser Val 




100 

1 C 5 





11 1 


10 

Ala 

Gin 

Val Lys Ala Met lie Glu 

Thr 

i_iV s 

Thr 

Giv 

T ’ 

tr 

T 1 

lie 

Pro Glu 




115 120 





1 "D C 




Thr 

Gin 

He Val Thr Cys Asn Gly 

Lv s 

Arg 

Leu 

Glu 

Asp 

Gly 

Lys Met 



130 

135 




1 4 0 




15 

Met 

Ala 

Asp Tyr Gly lie Arg Lys 

Gly 

Asn 

L^u 

Leu 

Phe 

Leu 

Ala Ser 


145 


1 5 0 



1 R R 




1 6 0 


Tyr 

Cys 

He Gly Gly 








20 



1 5 5 









(2) 

INFORMATION FOR SEQ ID NO : 3 : 










( i ) 

SEQUENCE OHARACTERISTIC 

■s : 







25 



(A) LENGTH: 4 9 5 base 

•a i r. 










( E > TV ? E : nuclei c a c i c! 











(C) STRANDEDNESS : sing 

lie 










<D) TOPOLOGY: linear 








30 


( ii ) 

MOLECULE TYPE: cDMA 










( ix) 

FEATURE : 











(A) NAME'KEY: CDS 








35 



(El LOCATION: 3..493 










(XI) 

SEQUENCE DESCRIPTION: .7 

-Eg 

ii:- n: 






40 

TACAGAC 

ATG GOT TCT GTC CGC ACC 

TGT 

C' r P r T 1 

GTC 

C' 'T' 

TCA 

GAC 

CAA TGG 




Met Ala Dei Val Arc: Thr 

C'c s 

Va i 

.71 1 

Arg 

Ser 

Asp 

Gin Trp 




1 5 




10 





CGG 

TTA 


GAG 

AAT 

GAC 

AAA 

GTG 

.■ 1 .* *. 

’ -* ■- »rp» 
nrt «. ■* /-v i 

45 

Arg 

Leu 

Met Thr Phe Glu Thr Thi 

3 1; 

As:. 

As;: 


Va 1 

Ly c 

Ly s lie 


15 


2 0 



2t 




3 Q 


AAT 

GAA 

CAT ATT ovGG TV 1 . .—a - 

A AG 

GTC 

TCT 

GTA 

CAG 

GAC 

CAG ATC 


Asn 

Glu 

His lie Arg Ger Gin Thr 

Ly s 

Va 1 

G ►? r 

Vox 1 

G i n 

Asp 

Gin lie 

50 



o c; 


4 'j 





45 


CTT 

Leu 

CTG 

Leu 

CTA GAC TCC AAA ATC CTC 

A ’ r ' 

r , r , r . 

r . r rr^ 



rp'TV'^ 

TCA TCC 


Leu Asp ,:'er Lys lie Leu 

Ly r s 

Prc 

K r s 

A i g 

Lys 

L e 11 





C, ,'i 

c c 





y r, 



55 



BATES, e: 


10 


15 


20 


25 


TAT 

Tyr 

Gly 

^ rT“T> 

He 

f' C . 

GAC 

Asp 

Lys 

'on.-. 

Glu 

Thr 

Thr 

lie 

r- ' v ['Z' r T 

Thr Leu Lys 

- c 

Val 

Va 1 

24 1 

AAG 

Lys 

CCC 

Pro 

3 0 

ACT 

Ser 

GAT 

Asp 

G A A 

Glu 

GAG 

Glu 

Leu 

3 5 

CCC 

Pro 

Leu 

rj-tmrn n-'|—’ 

Phe Leu 

.—■> Z^G T r C 

Val Glu Ser 

Ly s 

As n 

289 

GAG 

Glu 

95 

GGG 

Gly 

Gin 

rt V.» Id 

Arg 

His 

CTC 

LelJ 

10 0 

Leu 

Arg 


Arg Arg 

10 S 

TCC AGC TC A 

Va 1 

’o v 0. 

Ala 

110 

337 

CAG 

Gin 

GTG 

Val 

AAA 

Lys 

G AG 

Glu 

ATG 

Met 

115 

ATC 

lie 

G.*-\^3 

Glu 

AGT 

Ser 

GTG 

Va 1 

-TN. 

^Lr Per 

1 2 0 

GTG ATC CCT 
. ,;i 1 lie Pr o 

AAG 

Lys 

125 

A.rtG 

Lys 

335 

CAG 

Gin 

GTT 

Va 1 

GTG 

\ ci I 

■*. T. l-T-i 
J. 

rv5 D 

1 ~i n 

TGC 

Asn 

G1 

Lys 

Lys 

CTG GAA 

Lev; Glu 

gat GGA AAG 
Asp Gly Lys 
14 0 

ATC 

mu 

Met 

433 

GCT 

Ala 

GAC 

Asp 

TAG 

Tyr 

145 

A AC 

Asn 

ATC 

lie 

A.hg 

Lys 

A-.-* I 

’ C. i*' 

AGT 

TTG CTC 

Lev; Levi 

TTT CTG ACA 
Phe Leu Thr 
155 

AC A 

Thr 

C Ah 

His 

481 

TGC 

Cys 

ACT 

Thr 

160 

GGG 

Gly 

GGA 

Gly 

Hpr; ^ 









496 


30 


35 


40 


45 


50 


55 


(2) INFORMATION FOR 3E0 IF NO:4: 

( i } SEQUENCE CHARACTERISTIC!? : 

I A'- LENGTH: 1 62 amine. ari'-:n~ 

in TOPOLOGY : lir.4-.Hl 

(Hi MOLECULE TYPE: pt-.eU. 

(x 1 

Met Ala Ser Val Ary 
1 


Asp Gin T1i :> Arg Leu 

15 


Asn Giu 


Met Thr Phe Glu Thr Thr 3 It: Aon n sp ~/s ■ •* ' , 

^ r. * 

, .Glr. I1-- Leu Leu 

Set Gin Thr Lys . i .--=•! - * . ’ 


His lie Arc: 


Leu Asp Gei t/s 
5 0 


lie Asp Lys Git: Tr.r -hr 
65 " : 


! P P , ... 


i_*rU i 1 1 1 


Tyr Gly 


30 






BATES 


Ser Asp Glu Giu 


Leu Pro Leu Phe Leu ■ ai -'-a —e — " ‘ 


_ r . . » ? .. 1 ■*' "!=» ^ "’.'A i 

Gin Arg His Leu Leu Arg • a 1 Arg «ig • C ’ 1 ''* * 

10 0 3 - 5 x * ' 

T1 Th>- Ser Veil He Pro Lys Lys Gin Val 

Lys Glu Met Je Glu ee- -u* 1R* - J, =^ ioc 

115 12 0 * “ '' 

Val Asn Cys Asn Gly Lys Lys Leu Glu Asp -w *-yt> He -H. rtla " t 
13 0 13 5 1 ’ J 


Tyr Asn 
145 


lie Lys Ser Gly 
15 0 


_,eu Leu Phe Leu Thr Thr His Cys .nr 
■cq 160 


Gly Gly 

(2) INFORMATION FOR SEP IE 1:0 : 5 : 



(i) 

SEQUENCE CHARAC 

TER I! 



(A) LENGTH: 10 

4 0 b< 



(B) TYPE: raid 

6 1C * 



(C) STRANDEDNE 

.23: 



(D) TOPOLOGY: 

i ine. 


( i i ) 

MOLECULE TYPE: 

CDNA 


( ix) 

FEATURE: 




(A) NAME/KEY: 

CD3 



<E) LOCATION: 

* c 

-± —• . • 


( ix) 

FEATURE: 




( A) NAME / KEY : 

IT: a t _ 



( E J LOCATICt: : 

111. 


(x i ) 

c; irr )rjrr jr; £ ~’EC '"r. 

- r • rp ■- 

rpTC C 

:TTTC 

AA ATAC A- 

AA’ - 

AGC 

Tv,'-’?* 

AGC AAC ATC GCT 

CGC 

Ser 

Arg 

Ser Asn lie Ala 

Arg 



-15 


TGT 

GTC 

GGT GCT GTG GGC 

GGC 

Cys 

V cl 1 

G1 y A1 ci Vri - G 1 v 
t_ 

A i. ii 

TAC 

CTA 

GAA GTG GAC TAC 

ACT 

Tyr 

Leu 

Glu Vai Asi Tyi 

Thr 

1? 


2 



r i . T r. • ;;; 


ATG GGG ACT GCG 
Met G1v Thr Ala 


ACC AAT CTC ATT CTA TTT 
Thr Asn Leu lie Leu Phe 


... rr-, ; -r CTC TCT GTG AC A CAA CCG TOG 

-.. . 'p. . Lev: Cer Ye 1 Thr Gin Pio Tip 

Cl X ‘. 1 

His Glu Ala Val Thr lie Lys Cys ihi 
-> 5 3 0 


BATES, 


3 . i. . 


10 


15 


20 


25 


30 


35 


40 


45 


50 


TTC TCC GCA ACC GGA TGC CCT HT Cnn ~ 

Phe Ser Ala Thr Gly Cys Pro Ser Glu Gin Pro Tnr -ys 

-j c; 1 - 


ctg t. 


Trp PH 


CGC TAC GGT GCT CAC CAG C-T U. — ~ T -’ ^ ° ~ " cl .'. ^.. s 

Arg Tyr Gly Ala His Gin Pro Glu Asn Leu .ys Leu •“■-■t- Gl, • -- 

SO 


Abl unc. - __ _ 

Ser Glu Ala Asp Lys Phe Thr Val Arg Glu Ala Leu Lys G.u Asn 5 

v TCA AAT GAC AGT GCA ATT TAC 


ATC TGT GGA ATA — •*- — ^ ^ Lvs Gin 

lie Cys Gly lie Ala Phe Pro ,er -al Pi-■ -la «W —' - 


9 S 


GAC 

AAG 

Asp 

Lys 

ACT 

GTA 

Thr 

Val 

ATA 

GCA 

lie 

A1 a 

GGG 

ACC 

Gly 

Thi¬ 

ll? 

CGG 

A'O w 

Arg 
1 3 0 

Ser 

GGT 

GTG 

Gly 

Val 


GTG AGG GAG GCC CTC AAA GAA AAu 
al 
7 0 


GTG 


CAG 


a PTC G^G GTA AGA GAA ATT AAG CTCi CTC mGC 
AC, A CjjLjtA UCm vj'j'.j v_ i ^ ^ 

Thr Gly Gly Gly Thr Thr Leu Val Val Arg Glu lie Lys Leu ,eu ,-er 


1 4 C 


14 ‘ 


L 5 0 


1 


TCC AAC CCT CTA AGA AAG AAA GAA ATA AAA -HA G-v Tc« 

Ser Asn Pro Leu Arg Lys Lvs Glu lie Lys Glu ? sp ,er Gin L,s L, s 

160 155 

^ tt ‘^"-n-n r ' i>q Z- ^ ^\ rr 'Z' ClC’T CAA GAA CTA TAC c, AT 

AAG AGT GCT CGG CoT «TT n ^ , T r^,,. u \ <= 

, T n T-.I. . , T1-, 111 . ri 1 r (J| \ u Levi I /l HIS 

Lys Ser Ala Arg nig - °~ fc **- 7 or _ " ‘ “ 190 

17 5 1 ^ “ ' 

AAG AGA c-AT gT'o .-A.-. ^ ‘ , ■» o»- r r>v- 1 - 

Lys Arg His Val Glu Thr a,,u air, l.,t. -e. —a ^ 


» ^m, r-TT TCC AA r TAT GAA AGG CCA TAGAAACGTT 

TAT GAA AAC AGA AGA GT— -T1 ILL —- r- 1 

Tyr Glu Asn Arg Arcj Val Leu Sei .~.sr. Tyr -’•* 11 rtlc J ■ L ~ J 

210 215 

„ > .-rr^- r - rs’TilGC AGT GTTAATGAAC 

TTAATTTTCA ATGAAsT^a.. TV>— J - - 

AT AT ATC ATC AGGTCTT AAA AAAAAAATAA AGGTAAA'Tc, — 

GAAGGATGTC AGAATGTAAG GAAACTATAA CTAAT«GTv n T.-^.——1 i n-n-v. 


296 

344 

392 

440 

488 

536 

584 

632 

680 

728 

777 

337 

897 

957 


55 


BATES, et ■:> - • 


AACAAAATGC AACTGAAAAA TA 


(2) INFORMATION FOR SEC IF' NO : o : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 041 amino acids 

(B) TYPE: amino acid 
(D) TO PO LOGY: lineai 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEC ID '.<0:6: 


Met Gly Thr Ala Set Arci Sei Asn 
-22 -20 - 15 


Leu lie Leu Phe Cys 




; lr- Ala Arci His Leu Gin Thr Asn 


••.',1 Glv Ala Cvs Thr Leu Ser Val 
c 10 


Thr Gin Pro Trp Tyr Leu Glu Va, 
15 


kS p Tvr Thr His Glu Ala Val Thr 
a "> 5 


lie Lys Cys Thr Phe Ser Ala Thr 
20 


■ 31 V cvs Pro Ser Glu Gin Pro Thr 
it 4 0 


Cys Leu Trp Phe Ary Tyr Gly Ala His Glr. Pro Glu Asn 
4 5 ~ L ' 


e?u Cys Leu 


Asp Gly Cys Lys Se 


■ - phe Thr Val Ary Glu Ala Leu 


Lys Glu Asp. Gin Val Ser Leu Thr Val Asn Ary Val Thr Ser Asn Asp 
7 5 8 0 3 " 


Ser Ala lie Tyr lie Cys Gly lie A 


e Ala Phe Pro Ser Val Pro Glu Ala 
, r r 1 0 5 


Arg Ala Lys Gin Tr.r ’-.c/ 

1 1 0 

Lys Leu Leu Ser Lys --.ru L>ru 
125 


.i-- r pj- l ; chi L-'-u Val V.a 1 Ary Glu lie 
Arc: Ser Phe Leu Thr Ala Leu Val Ser 


Leu Leu Ser Val Tyr Val Thr G 

14 0 i 4 5 

Ser Lys Ser Lys Ser Asn Pro L 
155 15 ; -' 


Glv Val Cys Val Ala Phe lie Leu Leu 


^eu Arg Lys LVs Glu lie Lys Glu Asp 
- ; r. 170 


T ... r .. . t - > i— r 1 ■■ a’"' iv lie E he Git: Glu lie Ala Gin 

3er Gin Lys Lys L/s .-e! a- ... ...... . - - ^ 


55 


5 


BATES, e: 


Glu Leu Tyr His Lys Arc: His Val Glu Tr.r 
190 155 




Asp Asn Asn 
205 


Thr Tyr Glu Asn .Arci -Ar g 


Glu r.rq 


(2) INFORMATION FOR SEQ ID NO:": 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1042 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cINA 


l IX.) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 45..76" 

(ix) FEATURE: 

(A) NAME/KEY: mat _pept ld- 
(B! LOCATION: 111..7 50 


(ix) FEATURE: 

(A; NAME ■ KEY : misc_£eature 
(B) LOC ATION: i0 I 3 
(D! OTHER INFORMATION: nrte= 

are designated C, nut may be 0 ci T 


ides 101 3 and 1014 


(xi) SEQUENCE DESCRIPTION: SEQ ID lb: : 


TTCCTTTCAA ATACACAC 


Met Gly 


■ t ara 

X O'cbl 

i 1’ 1 -:i 


AGC AGA AGC AAC ATO GOT CSC CA. Too 

Ser Arg Set Asn II- Ala Al g Hi:- Leu Gin Tnt 

-15 - 1 


> '-pi.-*-. /'• rp ' r-T-lr PT 1 


As:c L-.-i; II- Leu Phe 


TGT GTC GGT GCT GTG GGC GCC TGT ACT CTC TOT GTC ACA CAA CCG TGG 

Cys Val Gly Ala Val Gly Ala Cys Thr Leu Ser Yal Thr Gin Pro Trp 

1 c 10 

TAC CTA GAA GTG GAC TAC ACT CAT GAG GCC GTC ACC ATA n.N. jG. nCC 

Tyr Leu Glu Val Asp Tyr Thr His Glu Ala Val Thr lie Lys Cys Thr 

15 2 0 - - 



BATES 


• • 



TTC 

TCC 

GCA 

ACC 

GGA 

TGC 

CCT 

TCT 

GAG 

H .“V .“a .. .. .—v 

ACA TGC 

CTG 

TGG 

TTT 

24 S 


Phe 

Ser 

Ala 

Thr 

Gly 

Cy s 

Pro 

Ser 

Glu 

Glr. Pro 

Thr Cys 

Leu 

Trp 

Phe 







T C 





’ - 



^ c 



5 

CGC 

TAC 

GGT 

GCT 

CAC 

CAG 

r' f~' <t 

GAG 

AAC 

TT -, g 

TTG GAC 

GG G 

TGC 

AAA 

29 6 


Arg 

Tyr 

Gly 

Ala 

His 

Gin 

Pro 

Glu 

Asn 

Leu Cys 

Leu Asp 

C' i .. 

Cys 

s 






5 0 





5 5 



c- 0 





AGT 

G Au» 

GCA 

GAC 

rt.-vo 

TTC 

ACA 

GTG 

AGG 

GAG GCC 

CTC AAA 

G e\e\ 

AAC 

C 

344 

10 

Ser 

Glu 

Ala 

Asp 

Ly 

Phe 

Thr 

Va 1 

Arg 

Glu Ala 

Leu Lys 

Glu 

Asn 

Gin 





65 





7 0 



7 5 






GTT 

TCC 

CTC 

ACT 

GTA 

AAC 

AGA 

GTG 

ACT 

TCA AAT 

GAC AGT 

GCA 

ATT 

TAC 

392 


Val 

Ser 

Leu 

Thr 

7a 1 

Asn 

Arg 

Val 

Thr 

Ser Asn 

Asp Sei 

Ala 

He 

T-yr 


15 


80 





S 5 




9 L 1 






ATC 

TGT 

GGA 

ATA 

G ■_ -•-h 

TTC 

CCC 

AGT 

GTG 

CCG GAA 

--1 s. Vf rto /-v 

GCT 

AAA 

CAG 

440 


lie 

Cys 

Gly 

He 

A1 a 

Phe 

P r c 

S e r 

Va 1 

Pro Glu 

.-g i a e. r g 

r\ — cl 

Ly s 

G1 n 



95 





-T f, ", 




10 5 




11 0 


20 

ACA 

GGA 

G'Grt 

GGG 


AC- 

CTG 

GTG 

GTA 

h'’ 2. .'0 2 

ATT AAG 

C TG 

Ctc 

AGC 

438 


Thr 

Gly 

G1 v 

Glv 

Thr 

Thr 

Leu 

■/a 1 

' ; 1 

A 1C! Glu 

lie Lys 

Leu 

Leu 

S e r 







115 





120 



125 



25 

AAG 

GAA 

CTG 

CGG 

AGC 

TTC 

CTG 

ACA 

GCT 

CTT oTm 

TCA CTG 

C T o. 

TCT 

GTC 

536 


Lys 

Glu 

Leu 

Arc: 

Ser 

Phe 

Leu 

Th r 

Ala 

Leu '.'a 1 

Ser Leu 

Leu 

S e r 

\ a 1 






13 0 





13 5 



1 4 0 





TAT 

GTG 

ACC 

GGT 

GTG 

TG C 

GTG 

Go v- 

TTC 

ATA CTC 

CTC TCC 

AAA 

TCA 


584 

30 

Tyr 

Val 

Thr 

Gly 

7a 1 

Cys 

Va 1 

Ala 

Phe 

lie Leu 

Leu Ser 

Ly s 

Ser 

Lys 





145 





j.5 0 



155 






TCC 

AAC 

CCT 

CTA 

A.GA 

AAG 

AAA 

GAA 

i\ T 1 ' 
i r\ 

AAA GAA 

GAC TCA 

C AA 

A AG 

AAG 

632 


Ser 

As n 

Pro 

Leu 

Arg 

T * f “ 

Lv s 

G 1 u 

He 

T - ..o 1 • : 

Asp Ser 

Gin 

Ly s 

Lys 


35 


160 





: z £ 




HO 






AAG 

AGT 

GCT 

CGG 

CGT 

ATT 

TTT 

V. .-A'--I 

GAA 

ATT CCT 

:/AA GAA 

CTA 

TAC 

CAT 

680 


Lys 

Ser 

Ala 

Arg 

A r g 

lie 

Phe 

G1 n 

Glu 

He Ala 

Gin Glu 

Leu 

T-yr 

His 



175 





1 c 0 




1 5 




190 


40 

AAG 

AG A 

CAT 

rjTQ 

G- A 


r rT 

C‘ A'G 

CCA 

<r~-r 

2 2^ O’ Z^T 


A A ' 

ACT 

723 


Ly s 

Arg 

His 

■.’a 1 

G1 u 

Thr 

Ar. 

G 1 r. 

G i !\ 

-V ^ >■ ■; 1 . ■ 

Lys n 31 ■■ 

As i. 

Asn 

Thr 







1 9 5 





2H- 



"> i', C 



45 

TAT 

bh A 

z* AC 

AG A 

AGA 

GTA 

CTT 

rr*''' r- 

A A C 

TAT GAA 

1-/: eo;. 

TAG 

A AAC' 

GTT 

777 


Tyr 

Glu 

As n 

Arg 

Arg 

Va 1 

Leu 

S e i 

Asn 

Tyr Glu 

Arg Pro 









2 1 C 1 





2 i c 








TTAATTTTC A 

rt 1 oA. 

AGTC 

/-■ rr> 
- i 


ATCC 

A e 

TC C VGAG 

CTATGGC 

AGT 

GTTA 

ATGAAC 

837 

50 

ATA 1 

TATC. 

AT C 

ALtCji i ■ 

CTTA 

AA A 

n.n.-v.n 

’ ■* T> - 

r\ r \o 

r 

aaaagac 

AAC 

TGGC 

TACAAA 

397 


GAA' 

GGATGTC 

nonn 

TGT A 

AG G 

nnrt'v 


r-\ a. 


TTACCAA 

> * i~T' 

X 

A C TAAAAC C C 

957 

55 

— 

rtrtv. 

AA.nA 1 

r T'("’ r' 

.-.ACT 

g r r 

A A T 

A CCT 

TC C A 

.-v ATTT'-G . A. 

2222222 

ATT 

CTATTCCAAA 

1017 





5 


BATES, 


(2) INFORMATION FOR SEQ ID HO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 24i amino acids 
(&, TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 


(xi) SEQUENCE DESCRIPTION: SEQ ID 130:8: 


Met Gly Thr Ala Ser Arg 
-22 -20 

Leu lie Leu Phe Cys .ai 


n He Ala Arg His Leu Gin Thr Asn 
5 " 1 0 

a Yai Giv Ala Cys Thr Leu Sei Val 


Thr Gin Pro Tip Tyr 


Lei: Glu Ya 


;lu Ala Val Thr 


He Lys Cys Thr Phe Ser Ala Tr 


,i- < 3 iv c.'s Pro Ser Glu Gin Pro Thr 
/c 4 0 


Cys Leu Trp Phe Arg Ty 


Vr Glv Ala His Glr. Pro Glu Asn Leu Cys Leu 


Asp Gly Cys Lys Ser Glu Ala Asp Lys Phe Thr Yai Ary Glu Ala Leu 
6 0 £ - 

Lys Glu Asn Glr. Yai Ser Leu Thr Yai Asn Arg Yai Thr Ser Asn Asp 


Ser Ala lie Tyr lie Cys Gly lie Ala Phe 


P • v C -i' 1' "v ci 1 P 1' O Glu ^ 1 cl 

105 


Arg Ala Lvs Gin Thr Gly Gly Gly Th 

110 h 


r Thr Lei; Yai Yai Arc: Glu lie 


Lys Leu Leu Ser Lys Glu Leu Ary -:-r 
12 5 1 • 


Pi.- Leu Thr Ala Leu Yai Sei 


Leu Leu Ser Val Tyr Yai Till Gxy -ar cys 
140 145 


al Ala Phe He Leu Leu 


Ser Lys Ser Lys Ser 
155 

Ser Gin Lvs Lys Lys 


Asr: Pro Leu Arg Lys Lys Glu He Lys Glu Asp 

i ft. n- HO 


Ala Arg Arc: lie 7 -he Sir. Glu He Ala Glr. 

I 3 5 




5 


Glu Leu Tyr His Lys Arg His Val Glu Tr.i 
190 15S 


;lr. Gin Ser Glu Lys 


Asp Asn Asr. Thr Tyr Glu Asn Arg Arg val Leu eel A 
205 210 2 


sn Tyr Glu Arg 


(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1253 base pairs 
(E) TYPE: nucleic acid 
(C) S TR ANDEDHESS: single 
(D; TOPOLOGY: lineal 

(ii) MOLECULE TYPE: cDNA 


(ix) FEATURE: 

(A) NAME/KEY : CDS 
(BI LOGAT ION: 3~. 5 0 

(ix) FEATURE: 

(A) NAME/KEY: ixiat_pepc id‘| 
(E) LOCATION: 103.."50 


(xi) SEQUENCE DESCRIPTION: SEQ ID MO:9: 


CCACGCGTCC GGGAAAAC 


CAGCG ATS GGC CCT oTG AOe ee 
Met Glv Pro Val Ser Th 


AGC AGG AGG GGC CTC CGG C * G 
Ser Arg Arg Gly Leu Arg Lei: G 
- 15 ' 1 0 


TG ATC CTT CTT CAA GTT 
lie Leu Leu Gin Yal 


GGT GTG GTG GGC GCC TGT ACT GTA 

Gly Val Val Gly Ala Cys Thr '-'a 1 

1 - 

GAG GTG GAC TAC ACG TCT CAG ACT 

Glu Val Asp Tyr Thr Ser Gin Thi 

2 0 


G„i Val Leu Gin Pro Gly Tyr Leu 

: 15 

GCC ACC ATG GAG TGT ACC TTT TCT 
• • 1 eu-> Met G11: Cvs Till Phe ier 


ACA ACT GGA TGC CCT GCA GTG 


TTG TGG TTT CGC TGT 


Thr Thr Gly Cvs Pro Ala Val Gin Pro Lys ,«r Leu Tip Phe mg - 
-y 4 0 45 


GGC ACT CAC CAG CCT GAA GCT CTG TGC TTG GAC GGA TGC AGA AAT GAG 
Gly Thr His Gin Pro Glu Ala Leu Cys Leu Asp Gly Cys Arg Asn Glu 


55 


BAT: 


10 


15 


20 


25 


30 


35 


40 


45 


50 


55 


Ala Asp Lys Phe Thr Val Lys Glu Thr Lei; 
6 5 " 0 


CTC ACT GTT AAC AGG CTG 


GAC 


-.sr Gin 


gc; 


Leu 

Thr 

Val Asn 

Arg Leu 
85 

oe i 

Pro Asn Asp 
c r, 



GGA 

ATA 

GCA TTT 

CCC AAT 

C_r 

CCG GTA CCA 

r-._ GO 

CCC 

Gly 

lie 

Ala Phe 

10 0 

Pro Asn 

'O * U 

Pro Val Pro 
10 5 


Ala 

GAC 

GGG 

ACT ACA 

CTG GTG 

GTA 

vG l Cj ■ctC. 

r P r T l 

TTC 

Asp 

Gly 

Thr Thr 

115 

Leu Val 

Va 1 

Arg Glu Arg 

1 2 0 

Leu 

Phe 

CAC 

AGT 

CTC CTG 

» m -* r-'TC' 

CTC 

TTA GCA CTG 

CTC 

GCA 

His 

Ser 

130 

Leu Leu 

He Val 

Leu 

Leu Ala Leu 


r\ 1 cl 

14 0 

GGT 

GTG 

rp G rr ' GTG 

rprpp- 

ATA 

GTC CTC TTC 

AGA 

TC A 

Gly 

145 

v’a 1 

Cvs Vet 1 

I1e Phe 
15 0 

lie 

Val Leu Phe 

A r a 

i £ C. 

G e r 

CCA 

AGA 

AGO AGA 

GAA ACC 

AAG 

GAA GAC TCG 

AAA 

AAG 

Pro 

Arg 

Ser Arg 

Glu Th r 

Lys 

Glu Asp Ser 

Lys 

Lv s 


.n'o 1 '- /-V-_ 


12 c 


AAA TCT 


CGT ATC TTC CAG GAA ATT GOT CAA 
Glr 
1 S i 


Arg He Phe Gin Glu He Ala Gin Glu Leu 


r\ - 

HlS L-iV s 


1 G I 


ksr- Cl* 


GAA ACA AGT CAT CAG CCT GAG C; 

Glu Thr Ser His Gin Pr Glu Gi 

19 5 2 '"1 

GCA CTC CCC AGC CCT GGA AGA CCA TAGATGTG' 

Ala Leu Pro Ser Pro Giy Arg Pro 

210 215 

GACAGTGCAA CTCCAGAATC TATGGCAGTG TGAAT.CC 

CAAAGAGAGC TGAGGTGTAG CTTGAGTGGC AAAGTGC 

AGCTTTGATC CTCAGCACCA CATAA CTCAG C««AgTg. 

TGTGTAGTAG TATAAAAAGT CAGAAGTTCA AGGT'nT 

AGTCAGAGAC ATGTTATCTT GTCTCAAAAA CACTGCC 

CAAGTGGGAA AACAGCCAGT CACGCCAGAA GGCAGAG 

AATGATGGAA TGTGAAAACC TCAAGAAAAC TCAACTG 

AGG A A C AG T C T.-h.-v^G.-W ^ ^ ^ r.TTT * n.n.-.o 


TAT 'on.-i rtB.c 
Tvr Glu Asr 


V CL 1 ^01 

8 0 

342 

ATC TGT 
lie Cy5 

95 

3 90 

ACT GGA 

Thr Giy 

4 3 8 

GAG GTG 

Glu Val 

486 

GTC ACC 

Va1 Thr 

534 

AAC ACT 

Asn Thr 

160 

582 

GCT CGA 

Ala Arg 

175 

630 

TAT GTG 

T/r Val 

678 

AGA aiAA 
. Arg Lys 

726 

1TAAACCATT 

730 

CAAACAACAG 

840 

ATGAAGTCTT 

900 

TTCCCAACAT 

960 

ATG AACCTGA 

1020 

AAGGGCAGGA 

1080 

CACGAACCAT 

1140 

CTAATTTTCC 

1200 

TTA 

1253 



BATE: 



10 


15 


20 


25 


30 


35 


40 


45 


50 


(2) INFORMATION FOR 3EQ 10 NO:!. 1 : 

(i; SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: prcteir. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 


; 1 v Leu Arg Leu Gly He 


Met Gly Pro Yal Ser Thr Ser Arg Arg Hy 

-22 -20 -15 


' - 1 Cl'- 


Leu lie LtU Leu a - n . i 

-5 1 


Glv Ala Cys Thr Yal Ser Yal 

C 1 0 


Leu Gin P 


ro Glv Tyr Leu Glu Yal Asp Tyr Thr O'er Gin Tin Yal Thr 


Met Glu Cys Thr Phe Ser Thr 


’ Cvs Pro A let Veil G1 r. Pro Ly; 


:nr G^y 


Ser Leu Trp Phe 
4 5 

Asp Gly Cys Arg 

6 0 


1 Cvs Leu 


ng Cvs Gly Thr His Gin Ire Glu Ala Leu 


?, i .. ,- r , r • -T-- p-'r,^ ^hr Veil Lvs Glu Thr Leu 

,SIi GlU lei cUr>p tlirr * . 


Asp Gin Asn Arg Va1 Per 
7 5 8 0 


L e u Th r ’1 a 1 A s: i A r g Leu 


rtsri nsp 


Ser Ala lie Tyr lie uys o 1 y 


11 * Ala Phe Pro Asn Glu Pro Va a Pro 

10 5 


Thr Ala Lys Gin Thr Gly Asp Gly Thr Thr Leu Y„1 Yal Ary Glu Arg 

no 1 - ~ - 12 ' 


Leu Phe Ser Arg Gin ..Hr Hr■ " 
125 i: 

Leu Ala Val Tyr «al .hi H g ■ 


Leu Ala Leu 


140 


1 4 c 


Phe IT 

i r : 


Leu Phe 


Arg Ser Lys Ser Asn Thr Pro Arg 
155 HO 


Lys Lys Lys Sei 


Ala Arg Arg lie Phn 


Ara Glu Thr Lys Glu Asp Ser 


Ala Gin Glu Leu 


Ty^ His LvAry T:i Yu 1 Glu 
12C 


•a V - ; 1 


■SI 


55 


BATES, ec ai. 


Asn Tyr Glu Asr. Arg Ly 


L'.'S Ala Leu Pr 


(2) INFORMATION FOR SEQ ID NO:i- : 

(i) SEQUENCE CHARACTERISTICS: 

( A\ LENGTH: 31^2 base pa : 

! B > TYPE: riuclric arid 

(C) STF.ANDEDNESS : single 

(D) TOPOLOGY: linear 

( 11 ) MOLECULE TYPE: cDNA 


FEATURE: 

(A; NAME KEY: CLG 
i p , t t ON : 41- 


( A) NAME 'KEY: ir.a t _p e p t 

(E) LOCATION: 112..1290 

FEATURE: 

(A) I :A.ME / KEY : R". i s c_ £ e a r u r • 

(E) LOCATION: 481 

(Di C-THEF. INFORMATION: r.: 


FEATURE: 

(A! NAME 'KEY: :nisc_ffri 
(E> LOCATION: 994 
(D) OTHER INFORMATION: 


rnav ):*# 


FEATURE: 

(B) LOCATION: 49.. 51 
T: OTHER INFORMATION: no 


■den might !■ 


[xii S EOU ENC E LEG C K I FT I 0 N : 




AT'G CCC 
Met Pro 


CTC AGC GCG GCti oCu oue, ^ 

Leu Ser Ala Ala Ala Ala Leu P 
-15 


, -,-,r T , ■ rjr'.g TCC "l 1 0 GC'C OTA ATT TTG CAC 

i ^, U pha Ala Ser Leu Ala Va 1 lie Leu His 


_ , ,, -t. , ■* /•** t. <T> rp T. *T* 

GAT GGC AGT CAm «To .--G.-. o..■ 1 • - ‘ 

Asp Gly ser Gin Met Arg Ala Lys Ala Phe Pro Glu Thr Ary «sp Tyr 


TCT CAA CCT ACT GO. 
Ser Gin Pro Thr Ai 


'T GTC 


La Thr '.'a 1 Glu Asp- lie Lys Lys Pro a 1 


55 


BATES 


CAG CAA CCA GCT A; 
Gin Gin Pro Ala L\ 
30 




Glr. Ala Pro His Gin Thr 


ATG GAT GGT CAT ATC ACC TTT C? 
Met Asp Gly His lie Thr Phc G; 


iiCA GCG GCC A-A GTA 
rhr Ala Ala Thr Val 


Ara Phr 


ITT CCA 
:le Pro 


ACA ACT ACC CCA GCA ACT ACA AAA AAC ACT GCA - 
Thr Thr Thr Pro Ala Thr Thr Lys Asn Thr -la .n; 
65 70 


■* r~~ g T'C' 


ACC TAC ACC CTG GTC ACA ACC CAG GCC ACA CCu ••• 

Thr Tyr Thr Leu Val Thr Thr Gin Ala Thr Pro --u»n .-%sn 


CCA ATT 
Pro He 


CAC ACA 
His Thr 


GCT CCT CCA GTT ACT GAA GTT ACA GTC GGv. --CT •-> " 

Ala Pro Pro Val Thr Glu Val Thr Val Gly Pro .T*r Leu Ala 
95 100 :j ‘ j 


TCA CTG CCA CCC ACC ATC ACC CCA CCA GCT v.-w 

Ser Leu Pro Pro Thr lie Thr Pro Pro Ala His Thr Tor A; 

110 117 12 ■' 

TCA TCA ACC GTC AGC CAC ACA ACT GGG AAC ACC ACT C«A CC< 

Ser Ser Thr Val Ser His Thr Thr Giy Asn Thr Thr Gin Pn 

130 135 


CAG ACC ACC CTT CCA GCA ACT TTA TCG ATA GCA CTG CAC A.-.A 

Gin Thr Thr Leu Pro Ala Thr Leu Ser He Ala Leu His Lys 

i 4 5 1 5 0 1 S 5 

ACC GGT CAC; AAG CCT GTT CAA CCC ACC Cal -’-..C 

Thr Gly Gin Lvs Pro Val Gin Pro Thr His Ala Pro t,*y Thi 

160 Hi l 77 

GCT GCC CAC AAT ACC ACC CGC ACA GCT GCA CT Guv. * v v «-'■> 

Ala Ala His Asn Thr Thr Arcs Thr Ala Ala Pro Ala Ser Thr 

175 131 


CCT TAT 


VCC AGT 
"hr Ser 


AGT AAC 
Ser Asn 
14 0 

AGC ACA 
Ser Thr 


GGG CCC ACC CTT GCA CCT CAG CCA TCG 
Gly Pro Thr Leu Ala Pro Gin Pro -ei 
19 0 1 P r 


TC AAG ACT GCG 
..1 Lvs Thr G1 


ACG GCA 


GTT CCT 
Val Pro 


ATT TAT 
lie Tyr 


CAG GTT CTA AAC GGA AGC AGA CTC TGT ATA AAA GCA Gao «Tg 
G in Val Leu Asn Gly Ser Arg Leu Cys II- Lys Ala Glu Met 


GGG ATA 
G1 v He 


CAG CTG ATT GTT CAA 
Gin Leu lie Val Gin 


A :7< V. > LVS 




Pro nicj 


Arg Tyr 



BATES 


10 


15 


20 


TTC AAC ATC GAC CCC AAC 
Phe Asn He Asp Pro Asr. 
24 0 

CGA AAA TCC AAC CTT CTG 
Arg Lys Ser Asn Leu Leu 
255 

ACA TTT ACC AAG GAT 


AAC TOT GGC 


TTf, TT. CAC GC* 

Leu Asr. Phe Gin 31: 


rp ^ rp T 1 X i nr ' AGG 


. <:i 1 r\ 




Thr Phe Thr Lys Asp Glu Glu Ser Tyr Tyr 


Glu Vn1 Glv 


j-\ i i 
hQC. 


27 0 


TAT TTG ACC GTC TCA GAT CCA GAG ACA aT - T-v- C.-^.-^ -H- ~'^ T 

Tyr Leu Thr Val Ser Asp Pro Glu Tnr lie - r Gj..i I*e Lys 

2 ^ H Cl Cl 5 M ‘J 


GCG GTG GTG ATG 


^ r ^ f' » /-■rnr- GGG “ T TCC TTC 

Ua Val Val Met Phe Gin Thr Ala Val Giy His Ser Phe 
~j r, c. s i v 


Va i 


AGT GAA CAG 


:TC CAG TTG TCA GCC CAC 


Ser Glu Gin Ser Leu Glr. Leu 


via His Le' 


Vo GTG n.-.n cA. CAT T 
In Val Lvs Thr Thr 


25 


30 


35 


40 


45 


50 


55 


GAT GTC CAA CTT CAA GCC TTT GAT TTT 

Asp Val Glr: Leu Glr: Ala Phe Asp Phe Glu a: 


335 


/— -r, m r' * r' Tr r 
bnl o.aT 


G TG f on’i' 'o .‘-vT 

Va1 Asp Glu 

3 50 


n-'-n 'T’g'T- Qpc TAC ACA ATT GT' 
Ser Ser Asp Tyr Thi He Va 

o c c 6 


VC CAC TTT GGA A AT 

5*' ; His Phi»? Gly Asn 


ry. r-p.^v A^T GGG 
; Pro Val lie Gly 


-u 


i 6 5 


GCC ATC GTG GTT GGT CTC TGC CTT ATC; GGT ATC 
Ala lie Val Val Gly Leu Cys Leu Met Gly !!V 


IT GTC TAT AAA ATC 
ly Val Tyr Lys lie 

~J C; i~i 


CGC CTA AGG TGT CAA TCA TOT GGA TAC CAG AOA ATC TAATTGTTgv. 

Arg Leu Arg Cys Gin Sei r u 1 y - 1 “•* 1 ~ ■ 

3 8 5 ° 0 

CCGGGGGGAA TG AAA AT A AT GGAATTTAGA GAACT. T . . . .-v. ... i . CC.«g -’■•’-'■’""T 1 ' 11 

GGAAATTCCC TCAGAGTGTG GGTCCTTCAA ACAATGTAAA C...e.aLT. LiftTTvnmT 

GAAGTGAGTC ATGTGT'GATT TAAGTTC'AGG CAGcV.r .~.C «- • i - *«««* ' ,Tl 

TATTTTATGA AAGATATAGT GAGCTGTTTA TTTTCTAGTT TC CTTTAGAA TATTTTAGCC 

ACTCAAAGTC AACATTTGAG ATATGTTGAA TTAACATAAT ATATGTAAAG TAGAATAAGC 

L . . r-.-p-.rp- . -T,- -rp, —..-tvjtGTGC ATTGAAGATT 

CTTCAAATTA TAAACCAAGc, u,T^m«;T.,T« «•. 

.-.-T-T. -r. - T.--i^,c;AC TTTCAGTGCT 

TTATTTTACC C.Hi.an . i ^ '. 

TTT ACT ATC T GTGTTTT AT 0 G TTT C AT 0; T A ACATAC'ATAT TCCTOGT'-GTA .JCA.TW-T 


913 

966 

1014 

10 6 2 

1110 

1158 

1206 

1254 

1300 

1360 
1420 
14 30 
1540 
16 0 0 
16 60 
1720 
1730 


BATES, 


CCTTTTCCAC TTTAAATTTG TTTTTGTTTT 


GCTGGAGTAC 


TCTCGGCTTA 


ttctcctgct TCAGCTTCCC GAGTAG 


AATTTTTGTA TTTTTATTAT 




GTCTTGAACT 


CTTGACCTCA GGTGATCCA 


-• -* ^ r 'm r-' ' r~ 
_ r-*. o 


CCATTGCGCC CGGCCTTAAA TGTTTTTTTT 


TGTCTAAGTG 


TGATGACTC 


A A AC AAT A AC 


GG C T A AG C AC 


v FT-/-’ - ^ omm rr-iT ^H-'rprprnrp AT'^ T 

TTTTT ATGC A A AAC«_ AA*J^iA j-. *-% i a x . m i /-% liixxi* a w 


G AG AC A A T A AT AATG' 




TTTATCTATA TCTCATTTCA TTCTCACAAC TTATAGGTGA ATGAGTAAAC 


TGAGACTTAA GGGAACTGAA TCACTTAAAT 
GCTTGAATTC ATGTTGGTCT GACATCAAGG 
TACAAGAACA ATGACACCAC ACTCTGCCTG 


GTCACCTGGC TAACTGATGG C AG AG C C AG A 


rr-i rprprp..—^/"’ I'J 1 ^ CTAC A 


3TTACC 


ACCTTACAGG 


CTTTGCAGAT 


AGGGGCTTTG 




-AGGCTCACA CCTCATACCA GCATACGCTC 
CATGAGACAT TAGGGTAGAT GAAAGGAGAG 
2, rn AAATCCTC CACTCTCTGG AAGGAGACTG 


'ATT A GTCAGTTGCT CATTTTTATG GGATTGCTTA GCTGGGCTGT 


AAAGATGAAG GCATCAAATA AACTCAAAGT 


-TTTTTAAAT TTTTTTGATA ATAGAGAAAC 


TTCGCTAACC AA 


n GTT r 'T rp TCTTGAGTGA TAGCCCCATC TTGTGGTAAC TTGCTGCTTC 


TGCACTTCAT ATCCATATTT CCTATTGTTC ACTTTATTCT GTAGAGCAGC CTGCCAAGAA 


TTTTATTTCT GCTGTTTTTT TTCCTGCTAA 


TAAGTCAGGA TGTTAACAGA 


AAAGTCCACA TAACCCTAGA A 


AAGGAATAAT TCAAGTCAGC CTAGAGA 


TGTTGACTTT 


rCTCATGTGT TTCCTTATC?A CTCAGTAAGT TGGCAAGGTC CTGACTTTAG 


ACATTGAATT GTAGTAAAGG TTTTTGTAAT AAAAACTTAC TTTgGAA/iAA 


AAAAAAAAAA A. 


(2) IMFORMATION FOF SEC IT HO:12: 


i ; 3E0UEMCE CHARACTERISTIC.: : 

I. A; LENGTH : 416 tv.. ivi-2 

'p. TYPE: i no &c i cl 


55 





- 



1 

1 

' 





• 

1 



BAT 

ES, 

et 




- 











( D) 

TC PCLOG 

Y: linea 










(id i 

•50 LEG 

'ULE TYPE 

: prate: 








5 


(x 

; i 1 ; 

SEQUENCE DESC 

T? p'p T 0 r ] • 

se-; 

) T r ; 

No: 

12 : 





Met 

Pro 

Arg 

Gin 

Leu Ser 

r. — Cl -L Cl 

i — Cl 

A 1 c: 

Leu 

Phe 

A 1 a 

S tr r 

Leu 


-23 



- 9 r* 



- 1 5 





-1 


10 

Val 

lie 

Leu 

HlS 

Asp Gly 

Ser Gin 

T * - ♦- 

r.c v. 

Arg 

Ala 

Ly s 

Ala 

Phe 

Pro 




_ 5 



1 




t: 





Thr 

Arg 

Asp- 

Tyr 

Ser Gin 

Pro Thr 

Ala 

A 1 ci 

Ala 

Thr 

Va 1 

G 1 ri 

Asp 


10 




15 









15 

Lys 

Lys 

Pro 

Va 1 

Gin Gin 

P r o A 1 a 

Li- s 

Gin 

A 1 : i 

Pi 

H r s 

G : n 

Thr 






1 0 



5 





-1 o' 


Ala 

A 1 a 

Ar g 

Phe 

Met As-f 

';lv H : s 

1 1 -r 

Thr 

Phe 


Th i 

A 1 ci 

» 1 - 
rt — ci 

20 




1 C 



5 ! 





c ; ~ 



Val 

Lys 

lie 

Pro 

Thr Thr 

Thr Pro 

A 1 a 

Thr 

Thr 

Lys 

A s n 

T n r 

Ala 




5 0 



£ c 





7 0 



25 

Thr 

Ser 

Pro 

I le 

Th r Ty r 

Thr Leu 

Va 1 

Thi 

Thr 

G 1 n 

Ala 

Thr 

Pro 



75 
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